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20CAGHA 

CTGCAGGCCAGCGTCCTGATAAGTGAATTCGCCGCCAC CATOGGAGOCCCAC 
CGTCAACCCCCCAGCAGCAGCAACAbcAGCAGCAACAGCAACAGCAGCAGC 
AACAACACrAOCAGCAACAGACTAGTCGTACGTATCCCTATGACGTGCCCGA 
CTATGCG TAG 

127CAGHA 

ctgcaggccagcgtcctgataagtgaattcgcccccac catgggaggcccac 

cgtgaaccccccagcagcagcaacagcagcagcaacagcaacagcagcagc 

aacaacagcagcagcaacagcaacagcagcagcaacaacagcagcagcaac 

agcaacagcagcagcaacagcagcagcaacagcaacagcagcagcaacaac 

agcagcagcaacagcaacagcagcagcaacaacagcaacaacaacagcaac 

agcagcagtaacagcagcagcaacagcaacagcagcagc-aacaacagc^agc 

agc:aacagcaacagcagcagcaacaacagcagcagcaacagcaacagcagc 

agcaacagcacicagcaacagcaacagcagcagcaacaacagcagctgcaac 

agcaacaocagcagcaacaacagcagcagcaacagactagtcgtacgtatc 

cciatgacctgcccgactatgcgt ag 
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20QHA 

MGGPPSTPQaorSRTYPYDVPDYA 



127QHA 

MGGPPSTPQ,2,TSRTYPYDVPDYA 



Figure 1. A) DNA sequences of 20QHA and 127QHA and B) their predicted protein 
sequences. The protein-coding region is underlined. The Kozok sequence is in italic. 
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Figure 1. P-elemem plasmid constructs for production of transgenic flies. 
Each construct has two P-elements for chromosomal insertion. To facilitate 
identification of transformed flies, a miniwhite gene is included to produce 
red pigmentation in the eye. A) Plasmids carrying the full-length cDNA 
encoding the fly PROSPERO with various CAG repeat sizes. The expres- 
sion of PROSPERO is regulated by five tandem upstream activating 
sequences (UAS). The yeast transcription factor GAL4 activates the tran- 
scription from these UAS elements. At its 3'-end, /7ros/?ero cDNA is joined, 
in-frame, to a short DNA sequence that codes for a heterologous epitope, 
hemeagglutimn (HA). Antibodies against HA will be used to label the pro- 
tein in immunohistochemical assays and Western blots, B) Plasmids carry- 
ing a partial cDNA encoding 422 amino acids of the C-terminal end of 
PROSPERO with various CAG repeal sizes. C) Plasmids carrying a DNA 
sequence that only encodes polyglutamines of various sizes. D) Plasmids 
carrying a DNA sequence that only encodes polyglutamines of various 
sizes, expressed under the control of one, two or five GLASS response ele- 
ments (IGR, 2GR, or 5GR). The eye-specific protein GLASS activates the 
expression of polyglutamines from the GLASS response elements. 
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Generation of the P-element insertion and screening for modifiers 

M P[A2-3]/P[A2-3] X F EP55/EP55 
i 

M EP55A^;; P[A2-3]/+ X F w/w 

M w/Y;pEP/+;+ or wA^;+;pEP/+ X F w;GMRyCyO;127Q/127Q 
Progeny screened for eye phenotype 

Isolation of the new P-element insertion (pEP = suppressor or enhancer) 

M (GMR;127Q)/pEP X F (CyO;TM3)/Xa 
i 

M GMR/CyO;pEP/TM3 X F wlll8 
i 

M GMR;TM3 or CyO;pEP X F w;GMR/CyO;127Q/127Q to test 

X F (CyO;TM3)/Xa to establish line 

i 

M +/CyO;pEP/TM3 X F +/CyO;pEP/TM3 
i 

pEP/TM3 or pEP/pEP established lines 

Genetic scheme used for generating P-element mutants, screening for modifiers 
of polyglutamine toxicity, and isolating a hypothetical modifier P-element insertion on 
chromosome 3. Homozygous EP55 virgin females were crossed with males homozygous 
for a defective transposon, expressing the transposase. The Fl male progeny were crossed 
with virgin wlll8 females. The F2 Male progeny that had coloured eyes and lacked the 
transposon's genetic markers were selected, as they contain a new stable insertion on an 
autosomal chromosome. These males were crossed with flies heterozygous for GMR- 
GAL4 on chromosome 2, balanced by CyO chromosome, and homozygous for UAS- 
127Q on chromosome 3. The resulting F3 progeny were screened for eye phenotype. 
Once a modifier was found, a single male was crossed to female (CyO;TM3)/Xa. The 
resulting male progeny were crossed to wlll8 flies to separate the P-elements. This 
resulted in colored-eye progeny that carry a balancer for one chromosome and a P- 
element on another. Males from such progeny were tested for modifier activity by 
crossing to female w;GMR/CyO;127Q/127Q. The lines were established by crossing the 
latter males to (CyO;TM3)/Xa, and by crossing the resulting flies carrying CyO and TM3 
balancers. EP55: source of transposable P-element; P[A2-3j: source of transposase; F: 
female; M: male; CyO: balancer chromosome 2; TM3: balancer chromosome 3. Xa: 
translocation (2;3) Xa. (Chromosome 4 is omitted.) 
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FIGURE 9A 



dTPR2 Protein 508 amino acids 

MDDEVIEISDSEREETSSNSEMDVEITTEQPTIDVKAEQIVPKDAATIAEEKKKLG 

NDQYKAQNYQNALKLYTDAISLCPDSAAYYGNRAACYMMLLNYNSALTDARH 

AIRIDPGFEKAYVRVAKCCLALGDIIGTEQAVKMVNELNSLSTAVAAEQTAAQK 

LRQLEATIQANYDTKSYRNVVFYLDSALKLAPACLKYRLLKAECLAFLGRCDEA 

LDIAVSVMKLDTTSADAIYVRGLCLYYTDNLDKGILHFERALTLDPDHYKSKQM 

RSKCKQLKEMKENGNMLFKSGRYREAHVIYTDALKIDEHNKDINSKLLYNRALV 

NTRIGNLREAVADCNRVLELNSQYLKALLLRARCYNDLEKFEESVADYETALQL 

EKTPEIKRMLREAKFALKKSKRKDYYKILGIGRNASDDEIKKAYRKKALVHHPD 

RHANSSAEERKEEELKFKEVGEAYAILSDAHKKSRYDSGQDIEEQEQADFDPNQ 

MFRTFFQFNGGGRNNSSFNFEF 
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dTPR2 cDNA 2239 base pairs 

GGCACGAGCCACTACTTCGCATGGCACGCTTTTTTCCGTGTGCTCGGTTCGTT 

CGGCCATACAAAACACAAAATTCAAGTTTAAAAACTAAATAGGCAACTAAAA 

GGGAAGCCGCAGCGAATAAAGTGATTTGCTGAAAGAGACGTAAGAAAGTTA 

ATCGCATCGAAGGCACCAGAAATCGGGGATTTCTAACACGGCGCGCGTGCGA 

CGTACATACATACGCAAGCGCACACACACACGAACAATTACTTGCCATTGAC 

GCAAAAGCGAAAAAGCAGTGGAATAAAGGGGAATTGACAAATAACAACGTT 

TTGCAAGCACTGGACTCTGGTCGCTGGTGTTCTTTCATTTTGTAATTGCCACG 

CATGGACGACGAAGTAATTGAAATTAGCGACAGCGAACGCGAAGAAACCTC 

ATCGAACTCCGAAATGGATGTGGAAATAACGACAGAACAGCCAACCATCGAT 

GTCAAAGCAGAGCAAATTGTGCCCAAGGACGCGGCAAGCATTGCCGAGGAG 

AAGAAGAAACTGGGCAACGACCAATACAAGGCGCAGAACTATCAGAATGCA 

ctcaagctctacacggatgccatatcgctgtgtccggactcggcggcatacta 

tggcaatcgggccgcctgctacatgatgctgctcaactataatagcgccctg 

accgacgcccgacacgccatacgcatcgatccgggcttcgagaaggcctacg 

tccgtgtggccaagtgctgtctggccctgggcgacattattggcaccgaaca 

ggccgtcaaaatggtcaacgagctgaattcgcttagcacggctgttgctgcc 

gaacagacggcggcgcaaaagttgcgccaattggaggccaccattcaggcg 

aactacgatacgaaatcctatcgcaatgtggtcttctatttggatagtgcctt 

gaaattggcgcccgcctgtttgaaatatcgtctactcaaggctgagtgccttg 

catttttggggcgatgtgatgaggccttggacattgcggtcagtgtaatgaa 

actggataccacatcggcggatgcgatatacgtgagaggtctgtgcctgtac 

tacacggacaacctggacaagggaattcttcatttcgagcgcgccctgaccc 

tcgacccggaccactacaagtccaagcagatgcgcagcaaatgcaagcagct 

caaggagatgaaggagaacggcaatatgctattcaagtcgggtcggtatcgc 

gaggcacacgttatctacacggacgccctgaagatcgatgaacacaacaagg 

atatcaattcgaaattgctttacaatcgggctttggtcaacacgcgtattggc 

aatttgcgagaggccgtggccgattgcaatcgagtgctggagctgaatagtc 

agtatctgaaggctctgttgctgcgagcgcgctgctacaatgatctggagaa 

gttcgaggagtcggtggcggactatgagacggcgctgcagctggagaagac 

gccggagattaagcgaatgctgcgcgaggccaagtttgcgttgaagaagtcg 

aagcgaaaggactactacaagatcctgggcattggacgcaatgcgtccgacg 

acgagatcaagaaggcgtatcgcaaaaaggcgctggtacatcatccggatcg 



FIGURE 9B (Continued) 



ACACGCAAACAGCAGTGCCGAGGAGCGCAAGGAGGAGGAGCTCAAGTTCAA 

GGAGGTGGGCGAGGCGTACGCCATACTGTCGGATGCTCACAAGAAGTCGCGC 

TACGACAGCGGCCAGGATATCGAGGAGCAGGAGCAAGCCGACTTCGATCCG 

AATCAAATGTTCCGCACATTCTTCCAATTCAACGGCGGTGGCCGGAATAATTC 

ATCGTTCAACTTTGAGTTCTAGGATCCCAACGAGTGTTGTTCACCACCACAGA 

GAAGAAGACCATCTCAATCCCATACTTTCTGCCTCATCCGAAACCAACATAC 

AGCAGCGCACAAATTTTGAACTCTTTTACATATTTCTTTTCCAAAAAGCAAGA 

AAATACCACATTTTGATTATGTTAACGAATGAATATATGCCAAGTTATTTGAA 

AAAATATTCTAAATCAAAATAATGCAACTAAATTTCCAGTGTAAGTTCACATT 

TTTAAATGTTCTTTCTTGGATTTTTTTTTCGGCAACATTAATAAATCATGGGAG 

ATTTGTGTTAAATAAACAGAAATATACATATAAAAAAAAAAAAAAAAAAAA 
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dMLF Protein 273 amino acids 

MSLFGALMGDFDDDLGLMNNHMNHTMNAMmQMRSMNRLMNSFMPDPFMQ 

VSPFDQGFQQNALMERPQMPAMPAMGLFGMPMMPNFNRLLNADIGGNSGASF 

CQSTVMTMSSGPDGRPQIYQASTSTKTGPGGVRETRRTVQDSRTGVKKMAIGHH 

IGERAHIIEKEQDMRSGQLEERQEFINLEEGEAEQFDREFTSRASRGAVQSRHHAG 

GMQAIMPARPAAHTSTLTIEPVEDDDDDDDDCVIQEQQPVRSSAGRHYSSAPTAP 

QNRYNY 
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dMLF cDNA 1753 base pairs 

GGCACGAGGAAAATATTCGTGAAAATTCTGCATACGGAAAGAAGAAAATTC 

GAGCAACAGAAAGCCAACACAATCCACAAAAATGTCTTTATTCGGAGCGTTG 

ATGGGTGATTTCGACGACGATCTCGGCCTTATGAACAACCACATGAACCACA 

CTATGAACGCGATGAACATGCAGATGCGCTCGATGAATCGCCTGATGAACAG 

CTTTATGCCCGATCCCTTCATGCAGGTCTCGCCCTTTGACCAGGGATTCCAGC 

AGAACGCTCTCATGGAGCGTCCGCAGATGCCGGCCATGCCAGCCATGGGCCT 

CTTCGGCATGCCCATGATGCCAAACTTTAATCGCCTGTTGAACGCTGATATTG 

GTGGCAATTCAGGCGCATCCTTCTGCCAGAGCACCGTGATGACCATGTCATC 

GGGTCCCGATGGGCGTCCTCAGATCTACCAGGCCAGCACTAGTACCAAAACA 

GGACCGGGAGGCGTTCGTGAGACCCGCAGGACGGTGCAGGACTCGCGCACT 

GGGGTGAAGAAGATGGCCATTGGTCATCACATCGGCGAGCGGGCACACATTA 

TTGAGAAAGAGCAGGACATGCGCTCAGGACAACTGGAGGAGCGCCAGGAGT 

TCATTAATCTGGAGGAGGGAGAAGCCGAGCAGTTTGACAGGGAGTTTACATC 

GCGCGCTAGTCGCGGAGCGGTGCAGTCAAGACATCATGCTGGTGGCATGCAG 

GCCATCATGCCCGCCCGTCCAGCGGCACACACCTCGACGTTGACCATTGAGC 

CAGTGGAGGACGACGACGACGATGATGATGACTGTGTAATCCAGGAGCAGC 

AACCGGTTCGCTCCTCCGCGGGCCGCCATTATTCCAGTGCGCCAACGGCACC 

GCAGAACAGATATAATTACTAAATCTAAAGTCAATACAGTATATTTTACTAA 

CTATCCGATAAAACAGAAACAGAATTGCATACTATAAATTTCTGCTAATTAC 

ATTCCCAACTGCGTTCAAACGAAACGAATATCGAATCGAAATCATAGAATGC 

ACAGAGCAGCATACATCCACATCCCTATGCCGCCAATCCGAGGCGCCAACAA 

CGTGCCGTAAAACATTTTCACACGGAGGACGAAGCGGCCAGCTCCTACAAGG 

CGGTCAAGCGCGGCAAGAAGAAGTAGTAGAAACGTGATCATCTGTATGCCAA 

CATCTTCCGCATCGCACACTCAAAAACACTAGGAAGCAAAGCGTTGGGTTCT 

GTTCCATAGCAGGAAAACCAATTCAAATATTTTTTAACAAACACAATTCTTTA 

CCAGTTCTGTCTTATCCTGCGTGAGTCGACCAGAATGCAACACTAAAAAATGT 

ACAACTTCAAGATGCTATTGATGTGCACGCAGGATACAGAACAACTTGCTTA 

AATTTACTTAAAACAAATGTGACTATTCAACGCCGAAATCATTACAACACAC 

ACTCTCAGACCTAATCGAAAAATTCAATGAAAGTAATGGAATATATATGAAA 

TCGTAATTATAAGTTTGAATTATTTGATTAATTCTCAAGTTTTTAGATTTTGTT 

AGCCACTAAGCTTTAAATTATGGATGCCAGTTAGCGTGCAAATGAACACAAT 

TGATTTGAAGGCTCCGAACGATAGAAAACAACAATTACCAATTCCCCAAATA 

CATGTAATTCGTAAGGCCTAAGTAAATGTTAACGTGAATTTAATTAAATGGTA 

ATTACATTATAATAGTAAAAAAAAAAAAAAAAAA 
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hTPR2 Protein 484 amino acids 

MAATEPELLDDQEAKREAETFKEQGNAYYAKKDYNEAYNYYTKAIDMCPKNA 

SYYGNRAATLMMLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMA 

ACRSFQRALELDHKNAQAQQEFKNANAVMEYEKIAETDFEKRDFRKVVFCMDR 

ALEFAPACHRFKILKAECLAMLGRYPEAQSVASDILRMDSTNADALYVRGLCLY 

YEDCIEKAVQFFVQALRMAPDHEKACIACRNAKALKAKKEDGNKAFKEGNYKL 

AYELYTEALGIDPNNIKTNAKLYCNRGTVNSKLRKLDDAIEDCTNAVKLDDTYI 

KAYLRRAQCYMDTEQYEEAVRDYEKVYQTEKTKEHKQLLKNAQLELKKSKRK 

DYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAEVQKEEEKKFKEVGE 

AFTILSDPKKKTRYDSGQDLDEEGMNMGDFDPNNIFKAFFGGPGGFSFEASGPGN 
FFFQFG 



FIGURE 13B 

hTPR2 cDNA 1756 base pairs 

CGGCTGCCGCGGAGTGCGATGTGGTAATGGCGGCGACCGAGCCGGAGCTGCT 

CGACGACCAAGAGGCGAAGAGGGAAGCAGAGACTTTCAAGGAACAAGGAAA 

TGCATACTATGCCAAGAAAGATTACAATGAAGCTTATAATTATTATACAAAA 

GCCATAGATATGTGTCCTAAAAATGCTAGCTATTATGGTAATCGAGCAGCCA 

CCTTGATGATGCTTGGAAGGTTCCGGGAAGCTCTTGGAGATGCACAACAGTC 

AGTGAGGTTGGATGACAGTTTTGTCCGGGGACATCTACGAGAGGGCAAGTGC 

CACCTCTCTCTGGGGAATGCCATGGCAGCATGTCGCAGCTTCCAGAGAGCCC 

TAGAACTGGATCATAAAAATGCTCAGGCACAACAAGAGTTCAAGAATGCTAA 

TGCAGTCATGGAATATGAGAAAATAGCAGAAACAGATTTTGAGAAGCGAGA 

TTTTCGGAAGGTTGTTTTCTGCATGGACCGTGCCCTAGAATTTGCCCCTGCCT 

GCCATCGCTTCAAAATCCTCAAGGCAGAATGTTTAGCAATGCTGGGTCGTTAT 

CCGGAAGCACAGTCTGTGGCTAGTGACATTCTACGAATGGATTCCACCAATG 

CAGATGCTCTGTATGTACGAGGTCTTTGCCTTTATTACGAAGATTGTATTGAG 

AAGGCAGTTCAGTTTTTCGTACAGGCTCTCAGGATGGCTCCTGACCACGAGA 

AGGCCTGCATTGCCTGCAGAAATGCCAAAGCACTCAAAGCAAAGAAAGAAG 

ATGGGAATAAAGCATTTAAGGAAGGAAATTACAAACTAGCATATGAACTGTA 

CACAGAAGCCCTGGGGATAGACCCCAACAATATAAAAACAAATGCTAAACTC 

TACTGTAATCGGGGTACGGTTAATTCCAAGCTTAGGAAACTAGATGATGCAA 

TAGAAGACTGCACAAATGCAGTGAAGCTTGATGACACTTACATAAAAGCCTA 

CTTGAGAAGAGCTCAGTGTTACATGGACACAGAACAGTATGAAGAAGCAGTA 

CGAGACTATGAAAAAGTATACCAGACAGAGAAAACAAAAGAACACAAACAG 

CTCCTAAAAAATGCGCAGCTGGAACTGAAGAAGAGTAAGAGGAAAGATTAC 

TACAAGATTCTAGGAGTGGACAAGAATGCCTCTGAGGACGAGATCAAGAAA 

GCTTATCGGAAACGGGCCTTGATGCACCATCCAGATCGGCATAGTGGAGCCA 

GTGCTGAGGTTCAGAAGGAGGAGGAGAAGAAGTTCAAGGAAGTTGGAGAGG 

CCTTTACTATCCTCTCTGATCCCAAGAAAAAGACTCGCTATGACAGTGGACAG 

GACCTAGATGAGGAGGGCATGAATATGGGTGATTTTGATCCAAACAATATCT 

TCAAGGCATTCTTTGGCGGTCCTGGCGGCTTCAGCTTTGAAGCATCTGGTCCA 

GGGAATTTCTTTTTTCAATTTGGCTAATGAAGGGCAACCACCCAGAACCCAG 

AAAATGCAGATTCACTCAGTTTAATCTTGAATGTGGAAACAGTTCACCTCCTC 

CCTTCATCACGTCTCCGTGTGCTTAGAGCAGTTTCGTTTTCTCAGTTGGATGCC 

CTGTGTCTCTGTGAGTGGGGTGGAGCAAAGGGAACCAATGCCGAAGACCGAG 

GGCAGGGGAGGGAGGCGGGGGTGGACAGGGAGGCAGCTTGTGAATTTTTGT 

TTTACTGTTTAACTTTATTAAAAAAGAAAAAAAAAAAAAA 



FIGURE 14 A 



hMLF Protein 268 amino acids 

MFRMLNSSFEDDPFFSESILAHRENMRQMIRSFSEPFGRDLLSISDGRGRAHNRRG 

HNDGEDSLTHTDVSSFQTMDQMVSNMRNYMQKLERNFGQLSVDPNGHSFCSSS 

VMTYSKIGDEPPKVFQASTQTRRAPGGIKETRKAMRDSDSGLEKMAIGHHIHDR 

AHVIKKSKNKKTGDEEWQEFINMNESDAHAFDEEWQSEVLKYKPGRHNLGNT 

RMRSVGHENPGSRELKRREKPQQSPAIEHGRRSNVLGDKLHIKGSSVKSNKK 

FIGURE 14B 



hMLFcDNA 11 16 base pairs 

GTTATGTGTTCCCGTCCGTACTGGAGGCTAGCTCTTGTCGCGGCCGCGGCGAG 

TTAACATCGTTTTTCCAATCTGTCCGCGGCTGCCGCCACCCAAGACAGAGCCA 

GAATGTTCAGGATGCTGAACAGCAGTTTTGAGGATGACCCCTTCTTCTCTGAG 

TCCATTCTTGCACACCGAGAAAATATGCGACAGATGATAAGAAGTTTTTCTG 

AACCCTTTGGAAGAGACTTGCTCAGTATCTCTGATGGTAGAGGGAGAGCTCA 

TAATCGTAGAGGACATAATGATGGTGAAGATTCTTTGACTCATACAGATGTC 

AGCTCTTTCCAGACCATGGACCAAATGGTGTCAAATATGAGAAACTATATGC 

AGAAATTAGAAAGAAACTTCGGTCAACTTTCAGTGGATCCAAATGGACATTC 

ATTTTGTTCTTCCTCAGTTATGACTTATTCCAAAATAGGAGATGAACCGCCAA 

AGGTTTTTCAGGCCTCAACTCAAACTCGTCGAGCTCCAGGAGGAATAAAGGA 

AACCAGGAAAGCAATGAGAGATTCTGACAGTGGACTAGAAAAAATGGCTAT 

TGGTCATCATATCCATGACCGAGCTCATGTCATTAAAAAGTGAAAGAACAAG 

AAGACTGGAGATGAAGAGGTCAACCAGGAGTTCATCAATATGAATGAAAGC 

GATGCTCATGCTTTTGATGAGGAGTGGCAAAGTGAGGTTTTGAAGTACAAAC 

CAGGACGACACAATCTAGGAAACACTAGAATGAGAAGTGTTGGCCATGAGA 

ATCCTGGCTCCCGAGAACTTAAAAGAAGGGAGAAACCTCAACAAAGTCCAGC 

CATTGAACATGGAAGGAGATCAAATGTTTTGGGGGACAAACTCCACATCAAA 

GGCTCATCTGTGAAAAGGAACAAAAAATAAATAGCCATGCATTTGATTTGTT 

TAGTTTTGATTGTTTTAACAGTTAGTAATGGTGCTGGGTAATAAGGATAAGAG 

CAATCTCTTGCTGTTAAATCAGTTCTGTCCTTGGCAACTTTCTTCTGATATCTG 

AATGTTCATGAAGGTCCTAGCTTTATATTGTCCCTCTTTTAGGAATAAAATTTT 

GATTTTCAACAAAAAAAA 
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For screening for drugs or compounds that result in 
over- or underexpression of the target gene 
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FIGURE 16 



dHDJl 5' region, 24333 base pairs 

TTACGGTTTATTTACTATTACTCTAGTTAATCAAATAAACTGTATAATTCCTGG 

CTTGTACAATAATTTTGCTAACACGCCGATGCGTTCGATCTTTTTTTTTACCGC 

TCTCCGTCGTATTCATCATGGTACATATTACATCCAACATACTTTATTTTTTTT 

GGGTTATTAACATTGGCAATATCGCTGCTCGCCGCCGTTCGGTTATGCTCTAT 

AAATAAAAGGGGGGCGCCGCTAAAATTATAATAAAATTTTCATGGGTCCTAA 

ATCTAGTCTCGAAATCTATGTACAAAGTTTGCTTGCATGCTGGTTAGGCATAG 

GTTCTTAACGTATTATTGGGTTGCTTTATTTCCATTCTGCGCAGTTGTGCAGCC 

TGTTTAGTGTTTGCCTTTACGGGGTTAACATTTTTTAAAAATGAAACATTAGA 

GCGGTAACCTTGTTGTCTGATTATTGGCGTCATTAAAGCGGTATCGCCAGCAC 

GCGATTGATGCAAGGATACCGATTCAATGAAATAAAAACGAATTCAGCCAAA 

CACAATCTTTCATTTCTTTTTTTTTATCGTACTTAATGATAGCCTTAGTTTCTA 

ATGGGACTGTGTGCTTCGGTGAAGGTTGGGGATGATTTTGGGAGGCAACAAT 

TATGTTCTAGCTTATAGCTTACAGTCCTACGCCTACTCCTATTTCTAATATGTT 

CATCATCAGCAGTTAAAAAACGTTTACAAAACTCATGCGAAATTGAAATCCA 

ATAACAAATGCACACGCCGCAGTCGCATCGGCGTCATCTCTTTCTCCTGACCC 

TCGCCTATCCGCATCCAGTTAGGTTTGCTGCTGCTGCTGCGCCGACGGTTGTC 

GCCGACTGAAGCCACCGCCGGCGGACAGATGTCGTTGCAGGGCTCGCTGCTG 

CTGGAACTTGGCGCTGCCTGGTCCTCCGAAGCGGTTGAACTTGAACTTGTTGC 

GCTGCTGGAAGTTCTGGCGATAGTTCTGATTGTAGAATCGCGGAAATCCTCCA 

CCTCCGCCGTTCTTGTTCCAGCGCTTCTGGCCCTCGTACTCCTGGAATGGATT 

GTACCCGGGCGTGCTGTTGCTGGCATTGTTTCCCTTAGCCGAAGCGGACTTCA 

CCTTCCGCTGACGTCCACGATCCATCTCGTTCTCTTCGTCGTCGTCGATGTCCC 

GCTGCCGCTGCTCACGCGCATCCACCAGTAGCTACGGAAAACAGAATATCAA 

GCATTAGGCTAGAGTTCGGACCTTGTGAATGGGGAGGCTTGGCTGGCTGGCT 

GACGCATGCGCTAGTTAATGGAGCTTATGCAGATGAGTACGGTCGCTCGCGA 

ACAAGCACTGGGAATATGCACATTGTATTCGAAATGGGTGAGTGGCTTACGG 

TTCACGGTTCACTGTAACAGGTTATCAGGCAAAACGGTAACGGCACAACGGT 

TGAATTTATGGCGTATCAGGCGGTTGAAATGAAAGAAACAACGTGCCGGCCA 

GCAGTCAAATCATAAGCTTCATTGCACGGGAAAACGGATGCGGAGTCATCGG 

GTGAATTACCTAGGCTCCGGTGCAGTCACTCTCTCCCGCAATGACTTTTGCAA 

CTCTCTCTACACTTTTCACGCTCGCTGAACGGAGGACGCGTTGTGGTGAGCGC 

CCGGTTGGGAACGGATACGAGCAACGCAGCCATCACAGACTATTCGGGGTAA 

TCGTATTATTTGTATTTGTTTTGTGTGGTATGTGCTTAGTGGGGAAAAAGAAG 

AAGCGTCGCCTCTGCCGCCGACGCTTCTACCTCCTACCGGCCGTCCGTGAGAC 

GATCCGGATCGGGTGCGTCAGCGGTCGTGTCTGTTACCGCCACTGCAATTACG 

ACCACATCTTTACTGTCACTGCCACTAGTCACTGCCGCGTCGACTGCAACCGA 

GCCCTCGACGATATCGCTGCCTTCCACACTGCCGTGACCAGCTATCCGTTTCG 

CACAAACCAACTCAAAAGTCTAAATGAATGGGGATAATGTGGAAACAAATG 

CAAATTACAAACAAGTTCGTTTAGTAAATCAACTCAATCGAATTGCATTTTAT 

GCAACAGCTAAGCGAACGACATAGAAAACAAAAAAGAAGACCAAAGAGCCA 

GTTAAATAATAAAGAATTAGTTAAACCCGCAAAAAGAGAACCAATTTATGTA 

CATTTTCATCGTATTAAGCCCGCAACTTGTTATTTTTGAAGCACAGACCCAAA 

GAAAGTGTTAACCATGCATAGATTTAGTATCTACGTTAGTGACATGGTCACA 

AGGGATAGATAAGCGCTTCAAGGTGAATGCCTCTCTAAACTCACCTCCTTTTC 
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GAGCTCCGCGGGCTTGCCATTCCAACTGAGCACGGGGGAGCCGTATCCACGA 

TACGATTGCTTCAGCAGCTCATTGATGGTGCTCCCATTCGAGGTGGCATTGCT 

CTGGTAGCCATTGCCCACCCTTGGCTGCTGGTGCGACTTGAGCGCACTGCCGT 

TGAGCAACTTTTGGCGCTTGGCGCCTGGCGTTGCGGGCGAGTCCGTGGAGGG 

CTTGCTACTCGAGAAGGGATTGCGATGGTTCTTGTTCGGCGTCGGAATTTTCA 

CCGGCGATCCCTCCACCACCACGACGTCAACATCATCTTCGATGGCATCGAC 

CTCATCGTTACGCGTAACTTTCCAGATACCCGTTTTCGATTTGATGACCGCTG 

GCGAGGGTGGCGTCTTTGGCGATGGTGGTGCTTTGCTTTGAGACTGTGATTGC 

TTCTGTGGGTGCCAGCCATTCGTTAGCTGAATGCTGGGCTCCTCCTCATCGTC 

GTCCTCATCACTGTCTGCGGACTTTTTTAGGCTCTTGAATATCTCGTCTATGGC 

ATCAGTCTTTTGTTTGCTGTTGTTAACGTGAACCGATGACGAGGCTGAGCCGT 

TGGTGTGGCTGCCATTGGTCTTACTGTGACCATTGGTCTGGCCAGACTCTTGG 

TCCGACTCGCTAGAATCCTCCCCACTTGGACGCTTGCGGGGATTCGGTAGAG 

GCGCCTCCTCCTCCTCAGATGCTGATTCGTAGGGCACAAGACTTTTCAGTGGC 

GTTTTGACGGGAGTCTTTACTTGAATCTTTACAGGAGACTTTGCCTTAGGCTC 

CGTATGATTCTCCGTCATATTGGGCATGCTCGGTAGCTGGGCTGTCGTTGGTC 

TGGGCTTCATCTCATCCTCGATGTCTTCATCATCCGAGGAAATTGGCAGATAT 

TGTTGCTGGTTTATTGATTTATGGTTGCTGCTGCTGTTGCTGCTTGGGGAGGA 

ACTTTTGTTACCATTTGCAGTGGGAGCTGAAGTAGCTTCGCCTTTAGCGTGAG 

CGCCAACCAGTGGAGGCTTCGCAGTGTCCTGAAACTTTCCGGTACCCAGCTG 

GAGTCCGTTCTGTGGACTTTGCTGGTTTTGTTGCTTGAACTGGATGGCGGTTTT 

CTGGGCATTTCCATTAGTGTAACCGTTTGCTCCACCCGCCGGCAGTTGAGGAC 

CAATGAAACGCGTTGGTGAAGGCGAAGACACAGTCGCCGCCGGCACTGGCGT 

TGTGCTGTGTCCGTTGGTCAAACGCACTCCATTGGGCCTGTTGGCCGCCGGAC 

TGGCAGCCTGTGAGAGGTCCAGTTCGAAAAACATTATATAGGCATTTGTGTT 

GCACACACTGTGCATTGCGATTGGCCGCACGTAGCTGTCGTCGAAGTTGTAA 

AAGCTGCCCGTATCCGTGGAGCCAATGGCCGTGTAGTGACCGCAGTGCTGGG 

ACGCCCCCAAGTGAGTGACCATCGACACCAGGCGATAGGTGAGCGGTTGAGC 

CTGAGCTGCTTGTGAACGGGCTGCGTATTTGCTCAAATCTATGCGTGACTTGA 

AGGAAATCTGCTTGGTCAGTTTGTTGCCGATCATGGAGAATCGCTTCAGCTGT 

ATACAAAGCGTGATTGGGGCACGCTCCAAAGAGAATTGCTTTGTGGCAGATA 

CCTGCAAGCGATACGTTTAAATAAAATGAACTACAGAACAAAGGTCACAAAG 

ACCTACCTTCTTCTTGCATCCCTCGCACTTGTAGCCCATATCCTCTAGCCGTTC 

GCGAGAAAAGTGTCCCTCGAAAGCATCCTCCAAGGAGTCTGCCTTGCGGATG 

TCGAGCAACAGATCCTGGAAGTGCTGAAACGTAATGGACACATGGTTGCAGC 

TCAGACAGCGCACCTCGCTGCGCAGATAGCCGCCAAAGATCTGTCCCAGCGG 

CGTGGTCTCCTTAACCAACTGATCCAGCTCTTTGTAGTTACGAAACCGCATCA 

AATACGCCCGCTCCATGGCCTCGACCAGGAAGCGCAGGAACTCGTGCGCATC 

CTCTTGGCGACCAACGACCATGTGTTTGCAGATCTGCTTTAGCTTCGAGTAGA 

TGAGGAAGGGTCTGACGGCCGACTGATTGCTTTGGGTGGCCAAAAGTGTTTT 

GGTCATGGCGCAAATGATGCAACCGCTGCCAGGTTCGGCCACATTGCAGTCA 

GCCAGATGCGCCTGCTCCGAAACGAGCCAATTGGCCAGGGCGGGTATGTGCA 

GGAGCGCCTGAAGCGTTGAGTTGAGGTAGCAGGTGTTGCCCACATTGATCAT 

GCCCGTGCCCACCTGCCATTTGCGCTCCGACTGCTTCCAGCCAATGCGTATGT 

TCTCCCGCGGATAGAGGACCCTCTTCGGCTTGGGCAGCTCATTGGGATTGCTT 

GTCGGATGCTGATGATTGTGGTGGTTGATGTGGTGCGACTGATTGTTCGGGTG 
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CTCCGCTTGCTTGCGGGCGCCGTTATTGTCTGCAAAGGTAAAGAGGACGGTA 

GACAGTTTAAGCACGTGCCACAGGAGAAGGCAGCAGGGAGACAGGAACAGC 

TTGTAGAGCAGCCACAGGGCGAACCCGTCCACCATTATCACAGTCATAATGC 

ATTTATTGGAAGAATTCCCTTCTGCAGATTAAGTCACTTGATCCGCGCTGCTA 

TGAAATATAAATAAAACGAGCAGTGCTCGCTGTGGAAACTGCTGACACACAA 

TCGCGCTTCCATCACCTGTTCGCAGTGTTGGAAAGGGTACACATTTGTTGTAC 

CTAGGCACCGGACTGTGCAGCATTAAGATAGCTATTCTATTGAACAAAGAAA 

CTTGAACACAAAGTATACGCCGAAAAAAATTTCCAGTACTAGATTTTGAAAT 

ACAATTCTTTGAACATCGTTACAGAATGTGATATCACCAGATTTTATCTGAAA 

ATATTTTCACAGCATCGTAATTTCATATGTACCCTGAATATGTATCTTGCAGTT 

TTGTTTGGGAAAGTGTACCAATCGAGGTACTTATCCTGGTACACATATCTCAG 

ATATTACCCAGCACTATTGTATCTTTGATAACAGCTAGCGTGTGAGCGGGATG 

GCGACTGGCAGAAGAAGAAATTTAAACTGATAACAGCAAGCGAATGAGAGG 

GATGGCGAGTGGTGAAGCAGTCCAAGTGTCTGCTGCCGACGAATACAGTGGT 

CTCGTTCTGGCGTAGGGGGTTGGGGCGGCAGTGTTGCCAACTGAATTTTTGGC 

GCGACCTAACAGTGGTTGTTGTAGGCCCAATGCTCCCCCCTTTTATTGTCTTT 

GTAACTGTGTTCGAGGCATTGACCAGGCCAAAAAAAAGAAAAGAAAAGAAA 

AGTCGAAACATCGTGTAACAGCTCCTGGTGCTGAGCTTTGTGTCCACTTCCTG 

CTCTGTGTGAATCACTTCTGCGAGTCTGGCCTTTGTTTGTGCTCTTTTTATCAC 

GCAAAAGCAGATTGCGGCGCATTTACCGCATCTAAAAAAATAAAGCAAAGCC 

AATAAAAGCACCGCTGGGGCTGGCCATGTGCGGGGGAAAGAGACGGAACTA 

CGGAGGGGAGCCCTCGTGCTTTTTGTCTTTTTTTCCTTCTTTCATTTGCCGCTG 

GAAATACAGCACGTTTTTTTCCGCCACAACTTCTGTGAATCAGAAGTTTGGAA 

GAGGCGGCTCTGTTGTTGCTGCTGCTGCTGTCACTTTTCCAGCTTACTCTTtAC 

GGCGTTGTACTTGTTTTGCTTTTTCCGCGTATTCCTTTGCATTCTGTTTACACG 

TACACCACCCAAAAACGCCGTCACACACGGACACACACGCACGCACTCACAT 

ACAGAAGCGCCTAAAAAGTACAGGTATGCTGCGCTGCCGACGTCGACTGCAC 

TGCCGACAAAATGCAGGCGGAGCAATAAAAAAAATATGTTTGCGGAAAAAC 

ATCACACGTGATTTGTGGAGGGATATTCCCAAAAGATTTGGCAAAAACAAAC 

GGGACGATACATAAATACATTTAAGTATATATACATCTTATATATATAATATG 

AAGATATAGCACATGGAAAATTGCGCAAAAATTGCCACACAAAGAAGAAAC 

ACAGACGAAGGCGCAGACGGAAAAAGCCACTTTTGCAAGCAACTTTTGGATT 

TTACATTTTGTTGTATCTTTACACAGTGTACTCACCATGTCCATTTGCGCCCAC 

AAGTTTGCCTGTATTGTTTTTGCCACTAAAGCCATTGATGGCGCCTGGATTTC 

CCGGCTTGATGACAATATATTTGGACTTGAGGTTCTCCAGCACCGATTCGTGG 

TAGTTGGGCACCTCCTCGTATTCGATTTTGGCCATCAGGATGCGTTTGGCATT 

GGCCACGATGTGATTTTGCAGGCTGCCiGTTGGTGTCCTCGCCGCTCTTGGCCT 

GGTCAGTCGACGAGCCGGCGGAGGAGTTGCCGCCAAGGGATTCGCGCAGCG 

CTGCGTTGACGACATTCGCCGTTTCGCATACGGCCATCGAAACGGGCATGGC 

GATGCTCCGGCTGGGGATTTGCGGTGGAATTTTGAACGGGTGTGAGGGGCGT 

GGTGTGGCGTGTGTTGGTGGTTTTCGCCACCCAGTTAGCTAATGCACATGGGC 

GTGCGATCCAAAGCAGATACTAGAGATCCTTCTGCACAGCCCACACGTCCTT 

CAAAACTCTCCTACTGCTCTACGCTCACTTTTCTCCTCGCCCCTCTCTCGAACA 

CTTCTTGTTTCACACACCGACTGCGACACCGACACACGCACACTAACGCACTC 

GGGAGCACTCTTCTTTTTCTGGCTTTTTCGCGCTGCGATCTCGATCTGTTGGCC 

TACTGAGCATTACGATTAAGAAACGTTCGCTCACAAATTGATCTGTTTCAATT 
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TCGTGCGCGGCCAGGCATTTTAGAACGAAAAGTCTGCTTTCGAAAATAATGG 

CAATTCCTTCCCTCGTGTTTCTTCCGACTGCGGATTCTCTTTTCGCTTCATTTTC 

GTCATTTGGGGATGCCAACTCGCGAGTGGCCAAGTGACGCGATAGGCCTCTC 

GAAATGTCCTAAAGCATTTCACGATATTTACAAAAATGTATTTCGATGTTTTC 

TTAACAATAAAAAATTGGTTTAAATTTAATAAGACATTTGTTACCTTGAATAT 

GTAAGCAATATCTTATTGAAAGGCTTGCAGCGACATTTTTTATTTATGCCTAC 

TATTCAAGTTATAAATTTAATTTTTATAACGGTATTTTTACACCTTATCAGCAC 

ATATCGATAAGTGTGATTGGGAACGACAACCCATCGGCACAATGTTGATGCA 

ATTGTTGAGCTAGCCTTCATAATTAGTCGCAATCAATCGAGCAGAATGGCTTC 

ATCCACAGGTCTCCTGGTGGTGTCCAACATCAAGCACGTTGGCAAATCCCTGC 

GAGCCATCGAGAAGTACGTGAATTCACTGTACATCCACCTAAATGTGGCGGG 

GTCAACGTCCACGACGTCACCAGTTCCACCGCCTCCGGTTTGGGGTCGTCTAA 

TCTCGCAGCTGTACGCCAACAGCAGCAGCTATGTGGGCAAGCAGTTGGACCT 

TCGCGTCCTTGTCTCTCCCCTACGACCAGGTGCCAATGGATCCCTGAAGTTGC 

GCCAGCCCGTCGACCTAATCTTCTCGGATGCACATCATCCGGAGCTGTGCGAC 

AGGCTTCGCGCGGATCTTAACATCAGCAAGCCAACAATCTTCCTGGATGACT 

CGGTCATCTCGGATTTAAGTGCCCAGCAGGATGACACCCAGCCGCCTAAGGT 

GTATCCCTCGGTTGTCCTGGGCGGAACATTCGATCGCATCCATCTGGGACACA 

AGATATTCCTCACCCAGGCTGTACTGCGCACCTGCAAGCGTTTGGTTGTGGGC 

GTAACCACCTCCGCCATGACGAAGGGTAAGACGGGCATGAATTGGCAAAATA 

AAACGCTTATCTTAACGACCATTCTTATCGCTGTCTGCAGGAAAGACGCTGCC 

GGACTTGATTTTGCCCGTGGAAGAGCGCATCGCCCGGCTAAGGGAGTTCCTG 

GTGGACATAGATGATACGCTGCAGTACGAAATTGTGCCCATCGATGATCCCT 

TTGGTCCCACGCAAGTGGATCCTGACCTGGACATGATTGTGGTCAGTGCGGA 

GACGTTGCGAGGAGGGCAGAAGGTCAACGAGGTACGCTCCGCTAAGCAACT 

GCGCGAGCTGGAGATCTTTGTGATTGACATTGTTGAAAGCAACGTGCATGAT 

GGCATCCACGAGACCAAGGTCAGCTCGAGTAACACACGCATCGATCTGCTGG 

GAACCCGCTGGAGAAGGCCGGAGCCACGACCACAGCTCCCGCCGCGCCCTTA 

CATTATTGGACTCACTGGCGGCATCGCATCTGGCAAGAGCAAGATGGGCGAG 

AGATTGGCCAACATGGGCGCCCACGTGATCGACTGCGATAAGGTGGCGCACG 

ATGTTTACGAACCTGGTCAGTTGTGCTACACCCGAATTGTGCAGCACTTCGGA 

CAGGGTATTGTTTCAGACGATGGTCGCATCGATCGGTCCAAGCTGGGACCCTT 

GGTGTTTGCCGATCCCAAGCAGTTGCAAGCACTCAACGGCATTGTCTGGCCG 

GAACTTATTGCGGAGGTTAACAGGCGGCTGGATGCACTGCGTTCCCAGGCGG 

ACGTGCCGCGTGTGGTGGTCCTGGAGGCAGCGGTGCTGCTGCGAGCGGGCTG 

GGAGACCAATTGCCATGAGGTGTGGTCCATGATTGTGCCACCGGATGAGGCT 

GTGCGGCGGATTATTGAGCGCAACAAGTTGAGCGAAGTGGAGGCCCAAAAG 

CGACTGGCCAGTCAGGTGCCCAATTCTGAGATCGTGGCCAAGTCGCATGTGA 

TATTCAGTTCGCAATGGGATCACGAATTCACCCAGAAACAGGCGGAGCGTGC 

GTGGAAAATGCTTACCAAGGAACTGGACTCTTACCAGAGCAGCCTTTAACCC 

GATGGATATTTAGATTATCTTGTTGATCCTTATTTTGTATGATTTTTTATGCAT 

TTGTTGTATATTGTTTAGTTGTAAGTCCAAAGTTGAAAAGAAATGCTGGGACG 

TCATTGGGGAAAAACGCTGAAAATTTCAATGGAACCTTAGTGGCTCTCGCCC 

TTCTTGCCAGCCACTCGCTTGAAGTCGTTCATCTTGGTGGTCATGATGGGGGA 

ACCGATGAAGCCGATATAATCAATCTGCGTCACATCGCCACCTGACTGATTGT 

TCTTCACGAAGATTTGGATGTTTTGCACATTCTGGAACTTGACGTAGCGCAGA 
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TTCACGGGCACTCCACTCTCCAGCTCCTTCTGAGCCAGGCTGCAAAATGGATT 

GAACAGTGAGAAGAGCTAAGCAGCCATAGAGAAGGCAATAGCTACCTTAGA 

TCCTGCACACTGTTCATGGACTCGGCCATGTCAAAGTCAATCGTGCGGGGCTG 

GTTAATGAACAGCTTCACATCCTTGGGACCCAGGTGCGAAGGTGCCTTGAAC 

TTCAAAGAGTGGATCTTCACAGCCTGATTAAAGGTGATGGACAGGATGAGCT 

GCTCATCGCAATCGGACTGCAGGTAGCCACCGGCGGAGGCCAGGGCGTGCTT 

TAAGTTGTGGTCATCAGCTTCGTTGAGGCACTCGCACTCCTGCTTCGAAATAA 

ATGTATTCAGTTCCATCTGTAAGAAGGATTAGGGATTATTTTTGGAACATTTC 

CAAATACTGCACTATATTACCAATCCCTGCCCGTAATCCTCGCCCCCCTCCTC 

GCCACCGGATGTACCGATGTGCTGCTGGATCTTGGCCTCGAGCCCATTGACGT 

CCGCACCCTGGACGCGATCGATCTTGGTCCTGTTCCTGTAGAAGATGAATGTT 

GGCATGGCCGAAACGCCCTGTCCAGCAGCCGTGTCCTGGCACTTGTCCACAT 

CCACTTTCAGGAAGATGGCCTTTGGGTACTTTGTTGGAAACGTCTCGAAGATG 

GGCGCAATCCGCTTGCAGGGACCACACCACGAAGCTGTGAAGTCCACCACAA 

CCAATTGAATGCCCGCTTGGGCCAACTCCGCCTGGAAGTGGGACTCGTCGTT 

GATCACGCGCACGGACATGGTGATAGGATTAGGTTTCTATTAATTGAGCTTTT 

GTTTCGGCAGCCGAATTGGATTTAAGCAAGTAAATGTTATTATTAACGTTCAA 

TGCAAATTTTTTTTGTTAAAGATGACTTGTAATATGCATTTAGTCCAAATTCGT 

GCTAAGAAAAATACCGAATGCGGTATTCCACAAGCGGTCACACTGTGATGGT 

ATCGATATTTCGAGCTCTTTGACTTCCTATTTTTAGAGGGACCATTTATGTGTA 

ATAGAAAAAAACCGAAACTTAATATTTAAACTTTTATTGAAATATTAGTGGA 

TTACAATATGTAAAACTATGAAATATTCTCATTTGATATAGCTCAAAGTGTTA 

TTTAAAATTCATTCAGTGTTTACGACTAGCAATCTACGCTTTCACGCTCATCTT 

AAGCTTACCGCCCATTTGCCAGGGTTGTCAAGGCGAATGAGCGGTCCCACCA 

TACACGCCACTGGAACTTTCGATACCTGCGCTGCGCCTGGCCACACGTTCATT 

ACCTCGTGGTGTTTCAGTCGGTCGCATTTTCATTAAGTCGCCATTTTAAAATT 

ATTAGAGTCAAGTACAATGGCAGATGTGGAAAAGGAGCCCGAGAAGACCAT 

CGCCGAGGATTTGGTGGTGACCAAGTATAAGTTGGCCGGCGAAATCGTCAAC 

AGTGAGTATTCCTTGGCCGGAAACAGCGAACGCTGGCCGATTCCTGGAGTCG 

CTGCTACGTGGCGCTTACACAATGCACCGAATGCCGCTTTCCCTTGTGCGCCA 

CGCGTTGGTTAATCTGCCTATTTCTGGACTCTGTCTGCTCGTTTAATTTTAGAA 

ACCCTCAAGGCGGTAATTGGACTCTGCGTGGTTGATGCCTCCGTCCGGGAGA 

TCTGCACCCAGGGCGACAATCAGCTCACCGAGGAGACCGGCAAAGTAAGTG 

GTGGCCACCTGGCGGTCATTCGCGCCAATTTCATGTCCAATGATTAAGACTTA 

CACCTTTGAGGGTTTCCCGATGGCGAGCCATGTGCTGTGCGGGCTGGGGATC 

ACCTCGTGGTCGCCAGGCGCACGCGGGGACTCCAATGCTCCACGTGCCCGGC 

TTGTGTGCTCTCCAAAAGGTCCCGAGGATTTACAGATTATGAGATCTGAGGA 

CACACCGCGCACTATCATTGATATATAGTACAACGAACAAGCAATCTAATGC 

TTTTATCGATCTTTCACAAACAGGTATACAAAAAGGAGAAAGACCTGAAGAA 

GGGCATTGCCTTTCCCACCTGTCTGTCCGTCAACAACTGTGTCTGCGACTTCT 

cgccagccaaaaacgatgctgactacacgttaaaggccggtgatgtggtcaa 

AATGTAAGTTGAACCTCCTATTCCACATATACCGCCACTAAATACGTAACATT 

TCTTTTCTACAGCGATCTGGGTGCCCACATTGATGGTTTCATTGCCGTGGCCG 

CTCACACAATTGTGGTAGGCGCTGCTGCGGATCAGAAGATCAGTGGTCGCCA 

GGCCGATGTCATCCTCGCCGCCTACTGGGCTGTCCAGGCTGCCTTACGTCTGC 

TCAAGTCCGGCGCCAATGTGAGTCCTCCCTTACTTCTAGGTAATCCTCCGTTA 
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ATCCCTGCAAGAAACGGATTGTCTGCCGCGATTCTCCAGCGACTGAACATCTC 

AACACTTGCAAAGATCAGCTGTGGCAGCTGGTAATTGCCCTGGCCTATTATTC 

AGGACTGGAGGCTTCTTGTCAGTTGTCCACAAGGTTATTTCTTCTGCAGGCAA 

CGGATTGACTGCGCTCAAACTCTGACACAGATCAGCTCAACACCTGCGGATA 

GAAACTGTGTCAATTTCGTGAACTGAACAAGTTCATTCCATAGAAGTGTTCGG 

TCTTTAAATTTGTCCACATCTCCAGTTTATAGATATGTCGGAATTGTAATCTGC 

AGGCAACGGATTGTCTGCTGCCTTAACTCGTGGCTCAGCACAGCTCAACGTCT 

GCAGAGATCAACAGTGTCGATTTCGTGAACTGAACAAGTTTAGATACTTGAA 

ATGTTCGGTCTTTAAAGTTGTCCACAATCGCAATGATAATGCCGATCAGTTAT 

TGTTATTTTGCGTTATCTATAGTATACTATGATATTTGATTAAGATTAGTCAAA 

GGGAATTGGAATGTTTTCTTTATCTCTGCTTTGAACTATTTCCATTTTATTTCA 

TACTTAATATTTATGTTTCAATTCTGTATCCTTACAGAACTACTCCCTCACCGA 

TGCAGTGCAACAAATCAGCGAGTCGTATAAGTGCAAGCCCATTGAGGGCATG 

CTCAGTCACGAGCTGAAGCAGTTCAAAATTGACGGCGAGAAGACGATCATAC 

AGAACCCCAGCGAGGCGCAGCGCAAGGAGCATGAGAAGTGCACCTTCGAAA 

CGTACGAGGTGTATGCCATCGATGTTATCGTCAGTACCGGCGAAGGAGTGGT 

TAGTAATCCATCAATAGACACTACATCTCCACTAATTTGTTCGATGATTAAAA 

ACACGCGCTTGAGGCTGACTTTGCTGGAATGCGGTGTTTGTTGCGAGAGTGA 

CTTGTTTGCTCGGCGTTTTTTTATACTAAAATGCGGCACGTGCAGACACCAAG 

TTCCGGCTGGCTGTTGTCCGAAGATTGCAAGATTATGAGATCTGAGAACGCC 

AAATTTAAGCTGGATCCTGGATCATCGCAGCCAGAGCATTATTGCTAACATTA 

TTCGTATTCGTTGCAGGGACGCGAAAAGGACACCAAGGTCTCAATTTACAAG 

AAGTCTGAGGAGAACTACATGCTCAAGATGAAGGCGTCCCGTGCTCTGCTGG 

CAGAGGTGAAAACCAAGTACGGAAACATGCCATTCAACATCCGCAGCTTCGA 

GGAGGAGACCAAGGCCCGCATGGGAGTTGTTGAGTGCGTCGGCCACAAGAT 

GATTGAGCCCTTCCAAGTGCTGTACGAGAAGCCATGTAAGTGTGATGCATAT 

TATTATTAATCCTATTCCCTATTATGCGAGTTGGCAGAACTTAATTCCGGACC 

TGGTACACCTTCGGGTGCTAAGTGCGGCCAGACATTTTGCCAGAACAAATTC 

CAGGCATTGTCGTCTTCAGCAGTTGCCTCAGTGTGGCCTCTGTCTGAACATGG 

CACTGTCACAATCGTATCCAATCTATTAACCTGTTTTCTTATACTTATTAAAGT 

TAATTTAGAGACTAAACTAGTTTGAGCAACCTTTATAAAGTTCGAATTTTAGC 

CGGAAGTAATAGCAAAGTTAAACAATCCTTTTCCTTATCTTGCATTACAGCCG 

AGATTGTGGCGCAGTTTAAGCACACGGTTCTGCTCATGCCTAACGGCGTCAA 

CTTGGTCACCGGCATCCCATTCGAGGCGGAGAACTATGTGAGCGAGTACAGT 

GTTGCGCAGGAGGAGCTCAAGGTAAGCTGCAACAATTTCCTTGTATTCACGA 

TGCGTACTCAATGAAATCTCAACTTTTTGCAGACTCTGCTCGCGCAGCCTTTG 

GGTCCTGTGAAGGGCAAGGGTAAGGGCAAGAAGGCAACAGCTGGGGCGGCG 

ACAAAGGTGGAAACGGCGCCGGCCGTGGAGACCAAGGCATAGACCAGCCCG 

CTGATGATGATCCGCACCGCCAAGCCATCAACGGAAACACAATGTGAACAAT 

TGCGCTGCCCAACGCTGCGCTCCACAGATTTTTACTATCGAATTCGTTGCGTA 

TTAGAGGACCCTTTTGACAACAGAACAGGACAGAAGAGAAGACGGCAACAA 

TTTGAGGATACATTTCCCCAGAAATCCTCCATCCATCAACAAGGCGGGCGGT 

CGGTCGGTCCCGCGCCAACTTTACCTCTTTATTTCCTTTACTATAAGCTGCCTT 

CGTTTATCGGTCTGTTCAACATCATCGCAACGAAAAAGCAAAGCAAGAACTG 

TCATCAAATTGTAACAATTTTAACGCTAAATGATCTTAAAATATAATTCAAGT 

GAAACGTTATTAACGCTGCGTAGTAGGTATTAAATAAAATTAACATTTTCTAT 



FIGURE 16 (Continued) 



AAAACAGCCGATAAATGCCAAACGATTTTTCATTTATTTACTTTCCGCTGGCG 

CCCAATTTTAATTCGATTTCGATACGCTTCTCATTCTAATAAATGCACTTGCG 

AGTTGTGTTTATTTTATACGTTTAATTTAGTTTTGATGTTCACATTCACATTAT 

ACAATTTGTAATTTAGATTTCTTGCCTTTTGTTATTTTAAATTTTACAGTCTCA 

TCTTTGAACTCTTGTATTACGAAAGTTGCAAGAATAACTTCGTTATGTTAAAC 

GTCACTTAGTGCTGTGCTCACTTGGCCACCCCAGTTGTCCATCCCAGATCCAA 

TCCCAACAAGACCAGACCAATTCGATGCCGTATACGGCGACTTTGCCCAACT 

CGCTGACCTCTTCCCTTGCGTCAAACAAAATAAAGAACAACAAAAAACGCAA 

TTGCTGCGGATGAAGTATAGAAAACACGAGCAGCACTTGCAGACGACAAAG 

ATATGTGGCCGGTGATCAAAAGAGGATCTGGGATTTAATGGTCTGCCGTCGC 

TTACATACATGGTTTGGTGTACTTTTTTTTTTTTTGTTATGATCGCCGCGACTG 

TTTTCTACTCGCCAGACTAATTATTGACATGCACGTCCATCGGTGCGGAGGCG 

GTCACGTTGCTCGACTTCTCCGGAGAGTCCAGGTAAATCTTCAAGGCACGTTC 

CCGGCGCTGCGCATACCGCGTGGTGGACACGCAGCCCACCCGATCCAGTCGT 

GCCTTCTCCCTGGCGTTCATCAGGCGTCGCTCCTCCAGCGTCAGCTCTCGCGC 

AGGTACCGTCCTATCTCTGTTGAATTCATTGGTTAGTCTAGGAACTGAACTGC 

CACTTGCTCCACGCTTACTTGTACAGGTAGATGTTTCCTGTCTGTGTGCTATTA 

AGCGGATATTTGTCCAGGGTGGTGGGCACGGAGTACCAACTGCTGGACTCGT 

TAACACTTAACGCTGATATGCTTGTGCAGGGGAAGTTGCTGTTCAACTGCAG 

AGAAGACCAATTAGATCAATATACACAGTAGAACGCAATTTTACGAACCTTC 

ATATAGCTCAGTTTGTCTATTGGGATGTCGCTGATCTGTGATAATGAAAGTCT 

GATTTCGCTGTCCTCTGCAGAAGATACGAGTTCGATTTACTGCTTACAGGGCA 

ATATACAGATTTAACTTACGGTCCAAAGTGATTTCTTGGAACTTTCCAAACTC 

CAGTTTAGCCGGACACCACCGTCTTACAAATAAAGTCAGAGAATCGTCCTTG 

GGCTGCGGGTCCACTTCCTCCTCGCACTCTTGCACAATGAACTGAAATGGTGT 

GATATAAAATCCAAGTTAAGTTTTTTTCTCATCACAGAGACAGGGGAACCCA 

CCTCCGCTGCGATGCTGGAGCGCATGTCGCAGTAGAGCGTTTCATCGTCGGA 

CAGAATTTTGATGGGAATACGTCCGCCCTTCAGCCAGATGCGACAGTTTTGCA 

CAGACAGTGTGGCGTACTTGGCGTCTATGCGATGCAGCTTGGCCACCAGTTCC 

TTCTTGGCCTGCTCCGCCGTGGTGTTGGCATTGTAGACCCACTCGCATACGCA 

AGGCAGTTTGGACGTCTCGTTATCAATGTCCGCCAGTCGCAGGAAGTGAATC 

TTGGCCTTGAACTGGTCCGGCTCCAGCGTCTTACCCAACTCCACCGTCAGTGT 

TTCGCCTTCTATGAGATGTACCAACGAGTTGTTCTGGTTGTTCGACAGATTAT 

TATCGTGTTTCCGCTGCAGTTTAAAGTGGGCGGCGGGCACTTGGATCAGCTGC 

TCGATGTGTTTCTTAAAGGCCCCCATTCGCATGTGTGTGCCCACCAGCAGCTT 

ATAGGCACGCGTGGGCTTACGCAGCTGAGCCTCTTCGTCGGACTGGTGACCA 

GAACTGGAACCGGTGCCAACCACATCCACGCACTCGACCTTTGTGGCGTAGA 

AGAAGTGGTTGGTGCTGGTGGGCAGGAGCAGCGGATCCACCACATCGGCCGC 

CGCATAACTGCCGTTTCCATTGCAGTAGGCATGAACGCGCATCATGGCATCGT 

GTGAGGCTGCTTCGTCCTCCGGACTAGACAGCTGCGGCGAGTGACTGGTGGA 

ACTGACTTGGCTGTCGCCTCCACCGCGATGTGCCATGTTGTCCGTCTCCACCA 

GAGTTCGGTCTCCGTCGCTCAGACTACTGTCCTCCGAGTTGGACTCGTGGCCG 

TGGCTGGGACTGGGTTGTGACATGGGTTCCACTAGATCTCGCTTGTAGCGCCT 

CCAATCGTAGTCGTTGCTGGATGACATGTGACCAGGTGCCACGCCATTCATCA 

TTGCAGCATCCACAACATCACCCCCGCTGGCGCACTGAAAGGACAGCCAAAA 

ATTAACCTTAGTTATAAACCCAACAGCTGTATAACCTACCTCGACTGACTCAA 
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CGGTGGGCACACCGAGCATCTCAAGGGTCGCGGCATCCGTGTTGGGTACGTT 

CAAGTAGAAGTATGTTATGGACTTAAACTGGGTGTTGGCCATGTTCTGGAGA 

TGTTGCAGGGCCTCCGGCGTCGGATGCGGATCGTAGGACACGAAAGCCTTCG 

GCACCGTGGCGCGCACCGTGGCGAGTAGGAACTGCTGCTCGCTAATGTGCAA 

ACGGAGGGCGATCGAGCGGCGAAGGACGTCGCTGGCTTCCCGCTCCCGCGCT 

GCTGAGTAAACCAGGAAGGGACCGTCCATCGCCATGGTCGATAGATCTACCT 

TAAACACATACCAAGTAATGCCGTTCGGCGGATAGACCTCAAACTCTTGGTC 

CTCGGCCCGGTACTCCAGCAGGAAGTCGAGACTGTAGTTCTGCGCCGCGCGC 

AGTTCGGTCAAAGCTGGGTCTGTGCAACTCTCCAGGGACTGAATAATCGTGT 

CCATCGAGGAGTTGTAAGCCACCAAACGACAGCGGGAAAGCGGGGCGAATT 

GTTCCACATTCAACATCTCATAGGCCGACATGAGGACGAAGTTGATGTTGAA 

ACTCTGCGATACGTACACTCTGGTAATCTTCATCTTCTTCAGGGAGGGATTAT 

AGAAGTAGACGCGCGGCTTGTACAAATCCGGAAGAGCCAAGTCCGTAACCGT 

GATATGCCTGCCCAGGCGGGACACGCGCGTCTCCTCCTCCGAGTGCAGCTTTG 

GTAGCAGCGTTTTGATGTGCTCAGGAAAGTCGGCCACCTTGGCGACTAGCTC 

GTTTCTTTTGGCATCCACCTGCCGGTACATCAACATGTATGCATTGGTGCTGG 

AGGTGTAGGCACTGGAGTAGTAGCTCCCGTTGGGTCCGCCAAACGAACGTTG 

GATGTCCTCTTGGGTGATCTATGGATAGATAGTCGTTCAATATTTTCTCAAGT 

TATGAATGTGTTGCGAAAACCTACACTAGTCACGTTCTGATCGTTAAAACAG 

AACCACTCGTTGTTGTCGAAGTCCTTAATATAAGCATAGTAGTGTCCGCCCGA 

AGCGCTGCCTGAATGAATCATGATGGCGAACAGTTCGTAGAGATACGGACCG 

GATCCTTGCTTGGCGCTCTTGCTGGTGCTGCTGCTCATGTCGATGCCTTCATCC 

TCGTCGTTCAGATCGTTTTCGTGCTGACTAGAGCTCGCTGTGGTCACCACGCC 

GCTGCTCAAATTATCGTCCTCCATGGCGGATCCACTATCCGCCGTGCTGCAAT 

CGTCCACGGTGCCGTTGAGCTGAGAGTTTTGCTCACCGCTGTTTCCACTTCGG 

TTAATGAACGTGTTCAGGTTGAGCGTCTGAGGGAAGGTCACTCTGAAATAGA 

GGGGGAGCATGGAATTAAATGCTTATGGATTATGGCAAAGAGACTAACCTGT 

CGTTTAATTTGATGCGGTGCATGGTCTGGTAGTCAAAGTCAAAGCGTTTAAGG 

TGCAGCGTGAGGATGTAGGGAAAGGACTTAAAGTGCAGTCCCTTGTGGGCGT 

CGCATTTTTTCTTGCACTTCTCGCACAGATACTGGTTATTGCCATCGAGTGTTT 

CGGGCTGAACGAAGGCACGCAGAGCTTCCTCGATGCTGCCGTATGCGGAGCT 

GCTTCCAAAGGGCCTCACAGGGAGCGGGATATCTAGAAAGGTGTCCTCGCGC 

GTCTTCTCGGTATTGCACTCCAAACACTTGACATAATCATTCATCTTGCCCTC 

GTACAGATTAGAGATGAGATTTGCCTGCTTAGTGTTCTTGAATTTGTGCTCCA 

GAGCGTCGAACATAACTCGGCACAGTTCCTGGATATCGTGCTGCTGCCATGC 

CTCCGTCGAGTCCCACCCAAAGCTGCGAGTCAGGTCTGTGGTTTCTACCGCCG 

CTTTGGGCGAGGTCTGCAAGTTGAGGAAGAGCTTTTGCAGTTGGTATGGTAT 

GTTCTTGGCCTCGTTGTCATTGTCGAACTCCCAGCGGTACAGAGCATTTCTGA 

ACTCGGGTGTCATAAAGAGTGCCTGCAGCAAGCTGTTTAGATAGCAGGTCAT 

GGCTTGGTTGACCAAACCAACATATCCCCTGGGACCCAAGCTCGCCTGCCTTG 

CCTCTGCCTCAGTTTCTGTGGTGGCCGAGGACACGAAGTCCGCACCCGTTGTG 

TTGACTCTCTGCCATGCTCGCAACTCATCGCCTCCATACTTGCGACGATAGAA 

GTTTGACAGAGCCGGGTACGTACCATCGTCTGTTCCAATTGTGGATGGGTCTG 

TCACTCCGGTCACACCCTCGACGTCCGAGTCACCGGTCGGAGCTCCGTAATC 

GTATCCAGGTCCCAGCATTGTCGGACTAGCAGATGCTCCGAGTGCCAGGTCG 

TCATCCGACAATTGTTCAGCATCTGAGATGAAAAGATCTGAAGGAGCATCGT 
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CCACCGGTGAGATTAGGTTTCTACCTTGTGGATACAGCTGCAGTTGCTCAGAG 

AGCTGTTGCTTGCTCAGGGTGTCTACTGGTTCGCAGTCCAACTCTTTGATGGG 

TGATGACGGCTTGATTGGACTCAGCAACTCTAAAGTCGGCTTAGAAGTGACC 

TTGGCCGTTTTCTTGGCCACAGGACTCTCCGAAGAAGTGTTTTTAGAGTTTAT 

CTCTGTAGACAGCTCGGGACATTCTTCAGGGCTAGCCCTGGGAGCTTTTTCCG 

AACCGGGCTTGGAGATCTTTGCTGCAGTCGTCTTGATCTTGGAAGTCTTTTCG 

GGACTTGATTCCGAACTGATGCTGGTCTTAGCCAAAGAGTCCTCACTCGTCGT 

CTTGGCCTTGCTTGGAGAGGAAGATCCCGAAGCTGGCTTCTTTTTCGTTTTCT 

CACCAACTACGCGTTTCTTCTTCTCGCCGGTGGCAGGACTCTTGGCCTTCTCG 

CCGTCCGACTTCATAACCTTTTTGACTACCACTCTTTTAATGGGTAACTCAAA 

GCGTTTGGTCACGTCACCATCCCAACTGCCGGAGGGCAGCAGGATCAAGTGA 

TTCTTCAGCTGGGGCTCAAAACCAGCCACTTCGTACATCAGCTGAGATTCCAG 

GGCATtCAGATTGACCTAAAGTAAAGGGGAATTCAATTAGCGGTTTATTAGA 

ACCTCAAGATGTGCAGATATTTTTACCAGATCCTTGTTATCGTGTGGCTGCAG 

CAACAGCTCGAACTTTTCGTACGAGAACTGCGTGCCAATAAGGTCAATCACG 

CGTTTCACCGTGAAGTGGGAGCGGACCACTACGTTGATCTTCTTTTGCTCCGA 

GCCGGGTGTCTGGTCAAAGACCGAGACGGTGCACTGCTCGCTCTCCTTGTCCG 

TCATGTCCAGCCCGCGGAATAATCAAGTGATGGTGGAGAAAACCCTGCAAAA 

AGATTGTAGGCGAAACGTTGGCTTTACTTATGAATTTTGTCTGGAGTTTTCTTT 

TTATTTTTTTTTATTTCTTTATTTTAGAATTAAAAAGGTGACACGACACCTTTG 

ACGTTTTCGGCGGGGCCAAGTTCCTGGACATGACGATGCTTCTTGGCCCATAG 

TAAATAAGGAAGAGATGCCCAGCCCCAAATTACTGCGAAATCTTCTTGTTTTC 

GACCCCATTCGCGAATAAAGCGGCAGAAACCAAGAAGATTCCGTCCCACCTC 

CCGCAGCCGCAGATATTGACGTGCTCCGGGTTTGCTTTTCGCGCCTTATTTGT 

ACGGGCCAGCACCAGTTGCCGTATACATATATATATATATATATAGATAGAT 

ATACACATATAGCACGTACACCCAATCGAGCATCGACTGCCCCCCGAAATCG 

ACGTCGTGACTAACGCGCAGGGGAATTTCGTAAACAACCGGCCATCAGAGTT 

GCCTCCGGAGGATGCTACGGGAATTATTATTTGCCTCCAATGGACTACCAAC 

GTCATCATCATCATCATGACCATAGCTATCACCATCGGGCGTACCGAATGCAT 

AAATTTCAGTGCAAATGTCGCTCCATGTTTCAGCTGGCTTCCTTTGTGGCTCC 

CCGCAAGACTCTGTAACGGAAGTGGTGGCTATTATACGAACGAATATCTGGC 

GCCTTCAATTCGGCAGTGCGCATATTGCAAGTGGACGGTGGACATATCCATA 

TGTACAAATTAATACTTATCGGACATCAGCGTGAACACTGCGAATTATTCTAG 

AAACATTTGTAGAATTCGAAAGATTTAAGGAAAGCAGATGCTGAATATTAGG 

CGAAAAGCGATTGAACTACTCTATAATATGCAGTCAAAAATATCATCGATTC 

GCCTGTCAATTAATTGTATCTAAAATTATACTTTTCGAATGTCTATTTTGGCAA 

TAATCTTTAGTGATTCGTACTGCTCAGCATTTAATTGAGTGTCGCAAGCAATT 

GGGGCCGGGGTATTTGCAATGTTTTTCCAATTCTCTGCACCGAAATAACCACA 

AAAAAGACAGCCAGTCAGCCAAGATATTTTGGGTCTCCTCCGAATGGAGGAT 

GCACATCCACGATGTGCGATGTGAATGCGCTGCAATTGGGCGTTCAAACACA 

TGTTGGATGGTCCAAACACAAACCGCATTGCCCGGCAAGGGAGCGAGTGAGA 

TGGGGATCCAAAAATGCTAATACACGTCGGCCAGCACAAAATCAAAATAAG 

AAACCCATGCTGCTAAAAATAAAAACTGGCGGCGGCGACACAACGACACAT 

CGGAGCGGTCGGAAAAAGCACACAGGCGAGTGGAGGAGCAAGATATAAGAC 

AGCTTTGGGAGCGTCTTGAATACGCGTATATCTGGCTATTTGTGAATGCGAAG 

GTTTTTGAGAAATTCAGAGAAGCGCACAGACTGTTCGAATACGTCTATCCTAT 
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ACATCAGAATGGTCAGGCACTTTCAACACATTGGCCCCATCCATCCCACTCAA 

TATTTACATGATGACGATGATCTTTTGGTCAATGTTTGTGTTGGTCGGGTATT 

ACAGAAACCGATATCGCGAGTTATCTATGCCATATACACGATCCAATGGGGG 

GACGGCGGGAGGGGCAACAGTCATGCTCGCATATATTTGTGCTATTTTTGAA 

CTATTTCGGTACTGCGAAATCTATGTGATCTACAAAAACCATGAGATGTCTGA 

GATATGACTGCTGAGTGCCGGAAATTGTAGGATTCTCGATTCCCGATCATATA 

ATGCATTCTCGAACAGAAAATCTCCATTACGAAATGCTTTCTATTCTTAGGCG 

TCGCACAACTTTAATTGGAGCTTCCAATGTTGTGTGAATAAGTGTGTATATAT 

CCGTGGTCTATATATGCAACGGATTTTGGTGAGTTTTACCGTCTGTGTCGGAA 

CTGAGTGTGCCGAAATCTTTCCGAACTAGAAGACCGCACCGTCAACGCACGG 

CATAGTTCACGCGTGTACTGGCCGCTTAGGATGCCGATGCCGATTCCGATTGC 

GATCCGAAGATACACCACCCGATCTGGCGCCCGATCTTTGGCGAAGCGAGCT 

ACGTGTTAAGTTCTCGGCGTGATGTACTATAACAATGAGAAACAGTTTACTTA 

TCTGGCTTACACTTCAATAGGAAAACAATACTTTTATATAGCTTCTATAACTT 

CGGGGTGCGATAAGAACATGAATACAGATACACGGATTGCAACAGTACCCA 

AGCCACTTGTTTTAAACAAATAACAGGATAATGGGGAGTAATGTAAGCTATT 

GACTGGGTTACAATCAGGGGTCTGATAACAATCAAACATTGTCCAGTTGCCTT 

TTGCGAATATCAATGACCACTCACGAGTTGCAACTGATAACGATTATCGCCG 

CACAATGCAGTGGGTGGGTATTTCACTGGGGGGAACTTTTGGGTCCCTAGAA 

CCCAGACGGATTACTCAATGAATATAGGCGATATGTTTGGGTTTACAGCGAA 

AGTGCTATTAATGTCGACCGTATGCTCTCTTCGATGTGCCAGCTCTCTATTTGC 

GGGAATGAATGACTATTTTTATGGGTCTGCCGTCGCTGCTACAATGCTGCATT 

GCTGCAGTGGGACATCCTTTGAACAGGCGCCATGCCAAAGGATATTCTTTGT 

GGAAGGGGGGGGGGGGGCAAGGGTTAAGGGTCACATTCGTTTGCGCAATAC 

TTCCAGCGATGGGGCGGTGAACGGTGGGCGGGGCGATCGGTCAAGGCTTCGA 

CTGTGGAACGTGACACGCATATGTCGGCCGGAGTTTGGCCCAAAAAGTGGCC 

CCAATGGTTGTCCTTCGCGCTGGCAATTAGTCCCTAGCAAGGCGCGTCCATAT 

TTTGCAAAAATTCGTGGGGCGCCTTGTTTTCTTCTCTCTGTATGTGTGCATGTG 

TGTGTACCTCCGTCTCACTCACCTCAAGTGTGTGTGTGTGTATGAAAATACTG 

CGGTATACGGCTGCGTTTGTGTGTGAGTGTGGGTTTCGGCTCTACTCTCCCGA 

TGATCCTGCTCCTCCGGTCCTAATCCCGGCCTGCTCGGCTGCTCCTGCGTCCT 

GACTGCGCTAGAAATTCGCTTAAAACGAGCCTCGACGGGTCATTTTTACAATT 

GTTTTTTGTTGTTCCGTTCGGCTGTTTTACCAGACGTGCTCGTTCCGGTGTGAC 

TGCCCGCCGCTGACTGTAAAATACTAAACGCATTGCAGCTGTGGCAATGCCC 

AAGTCTTGGTCTTACGGTCACACTGGCAAAGTTTAAAAATTTATTTATTTCAA 

CTTTCAGTTACTTTTCGTTGGCTTGAATATTACACTAAGAATTCAATTTGACAC 

TTGCAATTTATACATTGTATATTATAATATATTATATGTATTATATTTTATATC 

ATATAAAGATATTTATATCTATTGATCTTTTGATTATAAGCTCTTTGGTTGAAC 

AATATAAGTGCAACTTTCTCCATCACCTTCCTATCTTTTTACAATATGCTTACC 

TCGTCAATACGTTTTTTCTATTTCAAATATTTCAATATTTCAAAGAAATATTTT 

GTTTATTTTTCTGTGTGTTTTTAAGCAATCTGACCCCTGTAGAAGAATCCCTTA 

TAATATTAACAAATGTATCCTCAAAATAGATCGATCTCTATCTTCGCAGACTT 

ACACGAAACATTCCAGAACCGATAGTTTTATGCGATATATGAGATTTAAGGA 

GTACTTTCCGCATTTCGCCATCACAGTCACGCTTTCCTTGGCATTTGCAATCA 

AATAAGCGCTAATAATAATCGTAAAAGCATAAGAAGCATATAAAGAAGAGT 

CACCGCCAAAAGCATGCACAAATATATATAAATGGGGAGCGATTTAAAAACA 
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GTGCACTGTGTTTAAAACATCGACAGCTATCGGTTAGCATATCGATATTGACA 
TTCGCAGTCAAACGTTTTCGAGATACAACCCTAAAATCCGAGAAGCATCCAG 
AAATTTCGACGTAGACGAGGGCGAACCTATAAAATGAGGTTGACGCACGAAT 
CCCGCTCATTATTAAACAAAATTTTGAAGAGAAAGAAACTCTGAAGtAGGGT 

GTGTTTTTAGTGCGCAAGCCACATTTGGTGGATAAAG 
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dTPR2 5 ' region, 13015 base pairs 

AGACAAAGACAGCGCTGACTTCAGTCGACTTTCGTATTCATTGTTAAATGACA 

TGCAAATGTACGAATGACATGGCATTCGCCAAAGGGTTTTGAAAGGGGGGCC 

AGATCCAAAGGGCAGGTCTCAGGGAAATGTTTCCAGGCTAATTGTGGGTTTT 

ACGCCCTGTACTTCTCCAAATGATCAAGTACGTCATTTAATGGAAGCCACTGA 

CAATTGGAATCGTAAATTATACAGCACAAACTAGATTTGTTTGAGTGCTCTCA 

ATGTAGGCTAATATTAGATTTCTGCGCTGAATTAAAATTATTGTAATACGTAT 

TATAATGCATTTGTACCCAAATTTGACAGACTTAAGCAGTTCTCTAACATAAT 

TGGCATCATTGGCAAAGAGAAATAATATTAAATTGGCAGCATTGCCAGAAAA 

AACTCTTCTCCTAAATTTTGCTTGATTGAATGTTGTAGTTGAGAATGTTGTAA 

AATAGTGTTAGTATTGTAACACACGACATTTTTCAAATATTTAAATGAAAATC 

ACATGGTAATTAGCAATTTTGGGTGGCCTTCTTTCCTCCCCAAGCCAAAGCCA 

TATAATTTCAGCCAGCTACTTGCGATTTCCCCCATGACCAACAACAACAGCCC 

CATATGTGCAGTGCATTAATGCAGATTTCTTGGCAATTGTTTTTGCATACTTTG 

TTTTTTCCTCACTCACTTCAATTTCAATTGGCGTGCTAATAACTCATTTAGTTC 

GCAACAAAAAAACAAAAAACGAACAGCGGGCCACAAAAAATGTAGCTACAA 

ACATGGCACACCAACAATGGATTGGATGGCTAACCAAGATCGCCCCCACTTC 

CCTTTCCATCAATTGCGAATATATCGCATCTCATGATGCTGAGAGAATACTCG 

TACTCAACTATGCCGACTTTATATGAACACTGTGTGCAGTTTTGTTTTAGGCTT 

TGTAATTATTATAAAAATAAATTGAACTATTGTTGCCTCATTTAGATTGAACA 

GTGAGGCAGCCACAATGTTGCTTTTGTTATTCGGATACACTCAATTAAGCTGA 

ATTTGCAAAATGCAAATGGCCCGTATGAAACTCACACCTCGAAAATCATAGA 

CTCGAATTATTTTAGAAATTTAATATAATTATATTTTGTTTTCTTCTTTTTTTTT 

GGTTTGGTTTTTTTTTTTTTTTTGTTTTTTGTTTCCTTGCAACACTTTTCCGCCTC 

TCATTTTGACAGCCCGAGGAGTTCGGTTGGTTCAGTTGATCTCTTGATTGTCA 

GTCAGTCATTTGTGATTAGACATTCGAGAGTCGCCGCTATTGTTGGATGGCAT 

AAATTATAGTCTGTCTCAACAACAAAGCGCTGCATATGAAATCCACATAATA 

AATCAATGTGCTGTCGTAATTTGTGTTAAGTTATTTGTAATCAATTTGAATTCT 

CGCCGTACCTCCCCACCCCCCTCGGTTGGTGAGATTTATGGGAATATTTTATT 

CATTTTGCTATTTTGGTTAAATGGCTTTTTGGGGTTTTCCCGAATATAAGTTTA 

AAATTAACGCGGCAATAGGCTTAAGATCATGTAATATTATATATTGCCCGTA 

AACAAATGCTTTCTACTTTCATTATCATGAGTGTTTTAAAACTCCACGACTGC 

TCTAAACTTTAATCTTTAAATATTTTTGTACCCTTTGAAGAACTAACCACTTAG 

CAAATCCCTCCTATTATTTCCTCAAACTCTTGCACTTATCGAACTCGCTTCCTT 

TCCCCGCCATCTTCACTCGAACAAATTTAACAACAAATTAAACTGAAATGCA 

GTCAAATCAATCGCTGACTTTTCAATTCGTTTTTCCTTCTTTTTCGGCCCAACA 

TTTTCCACTTGGCCCGAGCGTTTTGCATAGTCCATGGCTTCGATTGGATCGGC 

TCGGATCGGTTGGTAAGTCTTCGGCGGAGTATGGCTTTAGTCCAATTTAGTGG 

AAAGGTGTGCCCACCAGCTCGGTCACAACACGTTGCTGTGGCTCATTGGAGT 

TTCGCCTTTGCCTCGCTGGCTTTTGAGCCGTTTGGTCGGTGCCGCTTAAACGC 

CGTTTTAGCCAAGTTAGGTGAAAAATGCCAAGGGAGTGAGGAGTGGAGACC 

GAACTGTCAACTGTGATCAAAATCAATTGTTTGCCATTTGCCAAACCAAATTG 

ACTGAGCCAAGTCAGTGCGAGTCACACAAAAATGCTGACAAAATTATACCAT 

AACCCATGAAATGTCAGTGTCAATAATTTTTGTAATTATGAGAGCATTGAGCT 

TGAGTACATAAAAAAAAAGTTATATATTTAAAAAAATCATTATTTTAGTTGGC 
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TGCCATTGGAGAAGCCCCCAAAAAAGGCAAACAAATATAATAAAAAATTATT 

GCAACGTAAGTTTTGATTTGAACAAAAGGCGTATACAATTGGATGAGCTCAA 

GAGTGTTTTAGAGTGAAAATGTGAGGATCATTGTTCGCAACCAACTAACAGA 

GGTTCGTCTCTAACATTTTTCAAAAAAATTACATAACTTTTAAATTTGATTTCA 

GTTTATTTGTAAGTGAGAAGCCTATTTTCTAACCATAAATTCTGCACGTTAAG 

AGTATTTCCTTTCATATCGTATCTACAAAAATCAATCCAACACACCTGTTTCA 

TCTACCGTTAACACCGTTAAGCCCCGCCCCATTTTCTTATCGAAAATATAGCC 

CTTTTTCACGCTCTATTTATAGCATTCACATTCTTTCTTTTTTTTTGCACTTTTT 

AGCTGGCATATCCTTTCGACTTCCGCCATTCGAGGCTCGCCCAATTTCCGTTT 

CGAGTTTAATTAATTTAATAAACAAATTCTTTTCGCTCTAAAAACTCTCAAGT 

GTATCGATACGATGCGTTTCTTTTTTTCCTTCGTTAAATAAATAATAACCAAA 

AAAAAAAAAAACCAAAAAGTAGGAGGAGAAAAGTTATTGCCATAGTTTTTTT 

ATTATACTTGTGTGTTTACCTTTCTGGTGGCTTGATCGATAGGCATCTGCAATT 

AAAAAGAGAAGAAGAAGAGACAAGTGAGGCAAAATTGTTAAACGTTTTGTG 

TAAGCTTTAATACGAAAAACAAGTACTGCAACATAACGGAAGGAAACACGG 

CTTAAATTCGGGGCACAAATGCTGAAAGGGAAGTTTTTCATTGACGGGTTCG 

TTCTGACGGACTTGCATTTTGGCGGGCAAGCGGGTGTGAAAATGCACACGCC 

CCGAGAACCCCCCTTTCCACCCCCCCTGGACCCCTTTATCCAGCCCACTGGCC 

AAAAACAATTTGTAATTATCCACAGAGAGCGCTGCCTTCAGCGGTTTCGCATT 

TCCCCTTTCGCTCGCTCTCCCAACTTGTTTCAATTTAGCGCAAAACTTTTTCAA 

CCTAATAATAGGTTTAACCGCATTTTTAACCGTTCCTCATGTTCGGTCCGGTTC 

GGTTTTCAAAACCGGGAATCGTACTTAGACTGGGTCTCCTTATTTCTGTTCTG 

GCTCTCTGTACAACTTTTCATTGAGAAAAATGTAACTAGTTTTTCATAGCAAC 

GGAATACAATTTAATCCAATAATCCAATAGTTTAATCCAATACAAATGATATT 

ACTACCATTTCTATTTTCGTTAATTTCGATTTGACTTATTTGGCTGGATTTACT 

TTTCAAAATATATGTTATCAATAAGACACAAACCTTACTTTTCTAGCTATTAA 

CATAGTTTAAAAAAAAAAAAAAACTAATAAAAATTACGTGAATCTAAATTTT 

TAAACCCGATATCCAAGAAGATCTCAATTTTTGCCTGTGTACTCAGTTCTCTG 

AACAAAGCGCATGTGCACTTTGGAGCACACTCCATACATGTGGCTCAGCCCT 

TTTCCATAATTAACTAGATGGTTTTCCATCGACTTCATTGTGGTCAGCGGCCA 

GTTCAACCCGTTCTTCACTGCAACCGAGAACTGTAAACACAAAAACCCCAGG 

ACTCTACATTGGCTTAAAAAAATAAGAACAGAACAGAACCAAAACCAAAAA 

AAGAAGGGATATTGAAATACAAGGTTGTAATCGTTTCGACTGTTGATGTCTC 

AATGCATGGGCAGTTCAGTTAGTAAATGTTTTTCAAAATCTTTCAGGCAGGAG 

ATGTTTAAATATCCATGAAATATTTGGATCTCCTGGGGATCAATCGGAATATT 

AGCCTTTAATTGTGTTGATCTTTTAAGCCTTTTTGTATCTAATCTAAGCCATTC 

GATCTAATCACAATTTATAAATATCTGCATATTTGTGTATAAGTCTGCATCAT 

TTGACGTAACTCTTTAAGTCTTTTGGCTTAAGTTGCAACTATAAGGAAGTATT 

TATTTTAGAGACACAAATATTTCAGTCGCCTTCATTTGAACAAATCGGCGAAA 

ATTGGCTAGCTCGCCAAACTTTCTGTAACCAAGGACAATGGTTTTATTTTAAA 

CCATTAAAAACTTTAGACCCACTAGCTCCTAGATCCCCCTCAAAAGATTTAAA 

AAAAAAAAACACGATACCCATTTCTACTGAACTTCGTTTTTGCTTGTCGTTTT 

TTCCACTCGAACGGAAATGAGCTGACAGCGCACCGCACACGTCGATTGCAGA 

AAAACATCGGATAAAACAGGAGGAAAAGTTGTGCAAGGTGGAAAACTGTTT 

TACCAACTATTGTTAGAGGCGTTCAAAAGAATTACGCAGCTTTCGGTTAGTTA 

GCAAGGGGTCACCGGGGAGCGTTACGTTTGCATTGCGTATTTCCGCTAAATGT 
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CATCGGAAAAGGCAAACGGCGAAATGCGAAACGAAAGTTTTTTGATTGCCCG 

TGTTAATCGATATCGATGCACAAACTATTTGCATTGCAACCGTTGCAAGAATA 

TGCAAGAAGTTGGGGGCGGCCGCGGCAGGGGGTGGAAGTTGAGTGCGTAAG 

TTGGCTAAAGCGGAAACAGGAAATGAGAAAATTTTGCAGAGCAAACCCCGA 

ACTGGAAATGCAACTAACTGGGCACATGCACTTTGCGAAATCATTGGATAGC . 

GTTAAGAAATTTATTTTAAAATTGTAACTAACATTTAATCGTATTCAAAAGCA 

ATTAAATCCCAATCCAATTCTTATATAAAATCCTTACAAGATTATTCTATTTA 

CTGTAAATCTAAGCAAAAACTCCCTTTGCAAAATATTCGCCTGCACAGCACA 

GATCAGTGAAATAATCAAATGAAGTCTTGAAATAACGAAAAACCCCCAATTG 

CGTGTGGAACTGCCCCCAATGCTTTTGCTTCGGTTTCGTACCTGGCCGTGGTG 

CAGTCCCTGTAGAGGATGTCGAAGTCCTTGCAGCAGAGCAACTTGCAGCGAT 

TGACGCCCGGCTGGATGGCGCCCAAGCCGCGTAATATGCGCACCTGTTCCAC 

ATTGCAGACGAGCGGTACGATGTCAAGGCGCTTTAGTTTGGTATAAACCGTA 

TGCAGACCGCCGACCAAGTGCTTCAGGAAGAGCTCGAAGGCCTGCGGCAGGC 

AGAGCATCGTTTCGTTGCTAATTATGAATGCGGCGACCTTCTGACCCCGGTAC 

TCCACCAGCTTGCACTCATTGGCACTGGGATCCGAGGTGGAGATCGGCGGCG 

GTGAGTTGTACGACCTTGGCGGCACATGGTGATGGGCGGCGGCCATCAGTTC 

CAAGGGGGAGGCATGGTGCATCATCTGCAGGGAGTTGAGGAGCCCCAGCGA 

ATGGGGCGGCAGTCCATGGGGCATTCTGGGCGGTAGGCCCGTCGGCAGTCCG 

TTGCCCGATGGCATTCCATGTGGCGGGGGGCTGAGCTGATGGTGTTGCTGCTG 

CTGCTGTTGCTGCTGCTGTTGTTGCTGCTGTTGCTGCATCTGCTGCATCATGGA 

GTGGTTGAGGGAGCTCACCGGACTGACGGCACTGGGATGGCGACTTGGTGAA 

CTGGCCGGTGAGCAACTGGATCCTCGTCCTGTGTGCGAGCTCCGTCCATTTGG 

ACGATCCTCGTTGGCTCCACGCGAACTGTTGCTACCGTCGCGTCCATTGTGCT 

GCTGCTGTTGCTGCTGGGCTGCTGCGGCTGCCGCTGCTGCGGCTGCCTGCTGC 

TGCTGCTGGTGATGCATCAACATGGCGGTGGTGTTCATATTGCCCGCGGCTCT 

TTCAATACCACTATTAATATTGTTATTTATTGCGCCCGCTTTATTGTTGTTATT 

TTCACTCTGTTCACTTGTCACAGAATCCATACTTCATCATGGCCGACACTTTTG 

TTTATTTACTTTTTAATCGATTCGTTAATTTGACGTTTTTTCTATCGTGACAAA 

AATTTGACACAAAGTAAGGGAGAAATAGAAAATAGATGGTGAGAGGAAGAT 

AAATAATTAATGAACTCTTAATTCATTTTTAATTATTTATTAGGCTTCTATATG 

CAAATTCTAAGTGAGCGTGTCTCGTATATTCCTATCCGCTTATTATTGGCTTTA 

CATTTTTAATACTTCTGTAAGTTTTATAACATCAAATTTAAATGCAGACCTTC 

AAAAAATTTACAAACGATTTAGGATTTGTATTAGGCTGAGCTATGCTCCTATT 

TATTAAAATCTATTTTTGAGCCAGTTTAGTTAGTTATATGGTAGCTACAAGTT 

TATATTGCTAAATATTTTTTGTAAATTAATATCCTAACAAACATTTTACTTACA 

AAGAAATATAGAGAACTAACAGAAAATAGAAAAGTTTCCTTTCAGACATTTA 

AAGTCCGATTATCTTCTAATACCCCCCATAAATAATCCTTTATCAACAGAACT 

ATTGCTTTGCAAACTTTGCTTTAATTAAGTTTTGGGAAAAACAAGGCAATGAA 

GCTAATTTGGATCCTTACTGCCAATTTGCATAAATATACCTATTGTCAGCTTT 

ATTTGAATAATTCGATATAGAACATAGATTTACCTTTAAGGAGGTCTAAAAGT 

AATTTATAAACTCAACATCACTGACACAAGACACTCGCGCACTTTGCTTTTTG 

AATTTCGCTGTGAAATATATACTCTGAATATTTCAAGTTATTTATTCCGATTGC 

CCGCTTGTGTTAATCGAGTTCGAATAACCGTTTTCGTACTGGAATTTTGGAAA 

CCGGAGCTGTGTCCGTTTTCGAGTACCGTACCGACGGATTGTCACTCAGAGAT 

TGAGAGATGGCAGCTACTCCGCTGCGACGGCGACGGTGGCGTCGCTGCCTCT 
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GCTTCTTCGCCTTCGACTGGTTCCTCTTCCCCCTCTCGTTCGGAGAAATCAAC 

GAAACGAATTGCATTCGAATGGGAATCGACTGAGAGCGAGACGGCGCGAGG 

CGACGACTGCGAGTGAGCGAGTGAGCGGGCGCTAACGAGTGCTATTTTTTTA 

GCCCACCCACACACACGTACGTACGTACGTACACACGAAGCGCTACCGTTAT 

GTACTGAGAGAAATGCGCGCGCAAAAGTTTTATTGCATTACCTTCTCTTGCGA 

ATGACAAATTCGTAATGAAAGGCGAGTTTCAATTCGATTCCTTTCGGATTTTC 

GTGGCAGCGACGCCGGCAGCGCGGTCGGCCGAGACGAGTGTGCTTGTATGTG 

TGTGTCTGTGCCTGTGAGAGCGAGCTGGTGTATCTGTATCTGCGATTGTGCAA 

AACCAGAATACGAATACGAGTACGAATACGAATGTCTGTTGCCCGTCCACGT 

CTCGCATTACACCAATACCAGGCCAAAAAGGGGAGTGGTATGTGCGATTGAT 

CGGTGTGTTTGCATCTGTGTATATTTCTGTGTGCAACCCCGAAAATACAATGA 

CAAACGTAATCGCTCTCTCTCTCTTGCATTCGTTTTATTTATTTTTTCAATTCGT 

TTGTGCGTGTTGTGTTCTTCAAATCCTCCCGCTCTCTCTCTTTGGAAAAAAAA 

ACGTTTTTCATTTCAATTTCATTTCGTTTCAGTCTGAGCCCTCTCTCTCACACC 

ATCTCGCCATCTCTGTCGCACGCTCAGGTGGGCTGCAACCAATAAACACGAG 

CGAGCGAGAAAGCAGCATATTTGCATAGCCAGTCGTACATGTTTGCGCTCTC 

GCTCGCCCCCATGGGCGACGCCTTATATAAACAAATGACAATTGTTTTGGCAT 

TTTGTGTTGCAAAGTAAATTATAATAAATGCATTGCCAGAGAAGAAAAGTAA 

AAAAAAATAGCTTTACTTCGAGTTTGCGCAGCTGTCTTTGACAAAAAGCATTT 

TAATTTCAATTAAAAGTAAATGACAAACTTTCAACGAATTATACTTTTCGGGG 

CAGTGTTGCTATCTCTTTCCGTCCCAAGCTTTGATTTTTTTTGTCAACCGTTTT 

CCGTTTCCCATTCGTTTCCATTCGAGTCCCGTTTATTTGTATTTCTTTTTGTGTT 

GCTGATTCGGAGGAGAGCAGCACTATGGCAGGGCATCTTTCTTCCACTTACA 

CATATTGCGATAATGGGGTTTTTTTTCGCCTGAGGGGCGTTCGTTTTTCGGGTT 

CTTATAAATAGCATTGCTTATAAATTCTGGCATCGCACCTTTGCCACCTCTAT 

ATGTTTATGTACAATGTATCTGAGAGCTCGGTCATTTTTCTATTATTTGTCTTC 

GTTTCGCCTTCTGCGATTCTTCTCCATAACGATTGCCATTCCGTCGCCGAACC 

AATCGCATTCCGTTCTCTCCATTTGAGAGTTCCATGTACATATTTCTTTCTATA 

TGGGAATGGAATACGTCTTTATGTATTGTGTTTGCACATGACGTATGAATTTT 

TCTTGTTCGTTTCGTTTGGGGCTTTTCTTTTGTGGATTTCCTCACCCACTGTCTT 

TTAGGTGACAGCAACCATTTAATATTAAATTGATTGCAAATGTGGATTTCCAA 

CAGCTTTTAGAAAATATTTTCGGGCTTTAAAGAAGAATTTAAAACACAATAA 

TTATTGTAATGTAAATATTTTATTTTTACATCGGTTTGTTTCATTAAAAAATAG 

TTATAAGATTATATTAGATATGAAAATTAATATGTAACGCTACTTTTTTTCTA 

AACTGTGACATTTTAGGCTATTTTTTCTTTTACCATTTCCTTATGTCATATGAA 

TTTCATTTAATTATGACATATACATGAATCGCTGGCTTTAAATTCGAATAAGT 

ACATTAAATTTACCAAAAATGACATGCAGAATTAAAAAGTATTCATTCAAAC 

AAATTTGTTTTCCCCCCATAAATGGACAACAAAAAGGTACTGCCTCTATCATC 

CAAGTGTCAAAATATGTCATAGCAACCAACTATCGTCAGTAAGAAATGAGTT 

CTACAACATGCAACTTTTTCATGGTGTCGCAACTGTGGGCGGGAAGTTTGATT 

TTTCGCAACAAACAGCTCGCTTTGAACTCTGGTTTTTCTCTTTAATAAATGCA 

ACTGATCTAAGTATTAAGTAAAATTGTATTTTTTATTAACCACAAGCAAGCGC 

AAAGATGAGTTTATATTCTAAAAAAAAGGAGGGTGATTAATTTCTATTAGTTT 

GGATTACAAATTTGGACTAGGAGTCAATTTGAAAGTCGTTATATCAATAATA 

CTTCTGGACTTTGAAGCGACAGTTACTGTTCCATAACTTCGGATTATCAGCTT 

TGCCTTCACCACATATATAGAGTATTCTCTGGATGTGTCGAGATTTGTATTTTT 
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AAACGACGACTGGATGGCAAAAGTTCAGTGCGCTCGCAGCTATTATGTGGAT 

TATCTGGCTCTTGCTGGTGCCCCTTGTGGCGGCCAGTTCCAATACAAGACTTC 

TAAATGGCATCCTAAGTCATGTGGACAAGGAAGCCAATCCCTGTGAGAACTA 

CTACAACCACGCCTGCGGCCAGTACAACATGCGTCACATCGACGACACCTTC 

TTCGACATTATACAAATGCTGGATCACCAGGTTAACCAGAACTTGGTGAAAC 

TAATGGACGAGCTGGAAATGAGTTCTCAATTGCCGGACTTTAATGTATCTAGT 

GTAGATGGCAAGGTCCTTCGTTACTACCTTAGTTGTCGTGGAGCGCCGCGGA 

ATATGGATAGTTTAAGCCAGTATCTGAAAGTGATTTCCCCCGGCGAAGGACT 

CACATGGCCTCAATTCATTCCGGACGGTAGTTCTTGGCCCCAGGAGAATTTCA 

AATGGCTCAAGGCACTGGCTCATCTGCATCGCTACGGTCTAACTAACGTGTTT 

TTTAACCTTGAAGTCGTGTCAAACCCACGAAATGCCAGCGAGTACATGGTAG 

AATTAAATACACCCACTTTTGGAGAAGAATCTCAACTGCCGAACAGTTTTATT 

GAAATTCTATCCGTTCTCTATATCATAAAGGTTCCTTCCAGTGAAATCATTAC 

TCTGGCGCGAAAAATGCGAACGCTTGAATTGTTGCTTAAAACGATGATCAAT 

CCGATCGACACACTGAATAATAGATACATTAGTATCCGCGATTTTCAGATGG 

AAACCGGTCACAACTGGCAGCGTTTCTTTGAGATTTTAATAGGCTCCAGCGCA 

GCCCCAGAACTCCAAGTGTTGGTGCGCAATTTTAGGTACTTTACCGCCCTTAA 

GGAACTAATGGACAAACAGGATGCTCGGCTGGTGGCCAGCTACATAATGACC 

CGATTTGCAATATTTCTATTGGATGAAACCATGGGTGGCAGAGAATCCACGG 

AGTGTGTGTCACAGGTGCGCCGCAACATGAATTTGGCTGCAAACATGCTCTA 

TAAGGAACGATTTTTCGAAGACTCCACTTTCAGTGCCAATATCCTGGAAATTA 

AGGACATTTTCGAGAAACTACGCCATCAGTTTCTGCTGCAAGTCGATCAAAA 

TCATCTAGAGTTGACTGCTTTGCAGATGAAATTTTTCGTTCGAAAGGCAGAGG 

CAATTGAGATCAACGTTGTGAATCTTCCAAAAACCGATGATCTTCGCCATTTC 

ATCGGCCAGTACTACCAAGACTTGCAGTTTCCCACTGGCGAGCTGGATTACC 

ATCAGGAGCACCTCAAGGTGCTGCAGTTTCGCACCCAAAAGATGTTGGCCCA 

ATCCAGCAAAGGGCACTCAGAGGAGCAGAATATTTTGACTTACAGGAGCCAA 

GCGGCGCCATTGCCTCCACCTCGTACTATGTGATGCGCCCCAATGTGATTATT 

GTCCCCCTTGGGCTACTGCAAGAGCCATTCTTTCAGCTGGAAAGCGAAGATG 

TCTTCAAATACAGCCTGATGGGATATATTATGGCACATCACTTGATAAGCGCC 

TTTGCCACCGAGGGCATTACAATTGGCAGCGATGGAAACGATCAATCATTTA 

GATCGCATCGTTTCGAAGAAGCAGTCAGTTGCTTGTCACGCAATTCAGAGAA 

CATCGATGAAAGCATGGGCGATATTGCTGGTTTAGAACTGGCCTATTTTACTT 

ATGCTAAGATGGCCAAGAATCGAAACCGTTTGGATTTCACCCATTTGCCACC 

GGAGCAGATATTCTTCCTAAATGTTGGCCAGTTCTTCTGCGGCAATAGCGATA 

TGTTGGTTCAGTACAAGGAAGATCAAGTGCGTTTACAGCGAGCTATTGAAGG 

gtttgagccatttgacaaggcttttgggtgctaccgcaataagcctaagcac 

gagaagtgtcgtttatagtgaataccttgtacatatgcttagaaatacatatt 

ttttgataacaataatacaagacaatcgtgttaaattataaaagtgttacaat 

cacatccattctgttcttttaaaattagttttaaactaacaatagtcaatagg 

ctaagatagttaaatgatcatcattcgaataaacaacgttcaagattgactct 

tcaatgtcatgcacctgcaagattaccatttattataaattaaaaaaacacac 

aaagttatacgtgtgttacttacatgcattacattcgggcctggccatccact 

taatatactgagatgtagcggtctttgatttgcgggatctcttatggatttta 

gaacattgttaactttgctgacaaagtaaattcaacttttaacgacttgtggt 

gtgtgcggcccgatgaaatgtcttaaaatacaaattaaatacaattcaaata 



m. • 
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TAATTCAGACGTCAAAAGGTTTAAAGTTAAAATATATTTTACCTTTTAGTGTT 

ATTTATACGTATGAGCCTTGAAAACACAGTTGAATATCAAACGGATTTTTGTT 

ACCAACAGATTCCAACAGATTCTCCAACTTTCGTTTTTTGATTGCCTATTCACT 

CGAAGATCTATTTCCAGTACTATGATCCTCCATAGTAGAGTCAGCTCAGGATC 

TTGTGATAATCCGCAAGCAATTCGACAAAGAATTCGTCGGCCAGAACAAAAT 

TTATTAAATCATTGTAGTCATTCTCAGGATCTCTCTTAACTGGCAATCCGTAA 

TAACGTATTTCATTATCTCCAAAATACAGTCGGAATTCAGATTAAATTTGCCG 

TTTCCGTCCTTTTTTATAAATATACATACAAATATACTAAGCAATAGACTGAA 

ATGAATTCTAGAATTTGAGGAAACTAATTATGTACCTTTATGAATACTTTTCC 

TTACTTGTACTAATCAAACTAATTTTTAACAGATTTTTCATGCCGAATGATTA 

CAATCTTATTTGGATGATTTGATAGAGCTTAGGAATAATGGTTTTAATTTTGG 

ATTAAAGAGTTGCGATTAAGAAACGAAGATATTATCTAGTTTTTGAAGAACA 

CAGGGTACTTTAAATTTCGCACGCGGAACGTCAAAACAAGAAGAAGTTTTCA 

TCAACACTGAATTTCCGCTTGGTAATCAGCTGATAAGCGTGCTCACGATAGCC 

GAGTTCACATCCAACAGATGTTTCCCTTAGCAGGGTTTCAGACCCAAATGATG 

ATTTATCTTATTTTGATTAAGCTCCAACACGCATTGCTTTGCATAATTCAGGTA 

TTATTAGGCTGCTTAATATACAATCCACTTATATTGTTGTGTCCATGAGGAAC 

ATCGACACGTGAGGATAAAAATATTTATTTATCGATATATTTTTACTCTTGAG 

CCTTTTGCACACCCCTAGTTGTGTTCCA 
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dMLF 5' region, 8374 base pairs 

GCCTGATTGTTTTCCACTTTGCAGCAGAGGAGCCGGGAAGGAGCGGTAGAGG 

CGCACCCAGTGTATCCGGCAAAGGCAAGTCACCCCAGGTGCGTTCCATGCCC 

AGCTCTCCGCTGCCTCAGCGATCCGCTACGCCGACGCGGCTGATGAGCCAAC 

GTGTCCGTGAGGCGGCCGAGCGTCTTGCCCAACAGCACACGGTGGCCAGTGC 

TCAGCGGCATtTGGGCAATGGGAGAGGCACTGGCACTGGCAATGGAAATGGC 

AATAGCAATAGTAATGGCAATGGTAATGGGAACACCGCGGAGACGAATCGC 

GAATCACGCGCGCGACGTCTCATCAACCGATTCAATAGCGAAACGCAGCATA 

TCACGTCCTAGTTTAAGTCGGTTAAATGCCGACGAGCATAACTTTATTACAGA 

TAAAGCAGATATAGCATTGTTTAAGTAAAAAATATATATATATACCCCAGAG 

AAACTTTACGAAACACTCGAATATGAATGCGACTGCGGATCAGCATCCCACC 

CACCCACACACACACGTCTACCCACTCACAGTAGGATATATGTATGTATGTCT 

GCATTCAAGCGGATGCACTCCCTCCGTTCAGAGGGAACTGTACTTAGGCTAG 

AGGAAGCTAAGTGTTTAAATTATTGTATCGATTTATATACATATTTACCATAC 

TAATTAAAGTTAATGTAACGAAAACGCAGGATCAGTAATCTTATTTAGTTCA 

ATGGTAATCAATGTGCGATTAGCGGATGATCGCGCTCCTTGAGTCGCACCCA 

CAGTCCGCCGGAGGCTCTCAGCGTAATCCGGAAGGTGGCCGCAATGGTTGTC 

TTTCCGGTTACAGGAAGCAGCTGGTAGCTACGCAGCAGGCGCGACACTATGG 

TCTTGATCTCCATGATGGCGAATCGATTGCCAATGCAATATCTCGGTCCAGCG 

CTGAAGGGTAAAAAGGCGTAGGGATGACGGTTCTCGGAGTTCTCGGGCGAAA 

ATCGCTCCGGCTGGAACTTTTCCGGATCGGGATAAATGTGGGCAAGACGATG 

GGTGGCATAGGGGCAAATGAAAACGTTGCTGCCGGCGGGCAATGTGTGCTTT 

GCCAGGCGAACCTCTTCGCCCAGTTTACGAGCAATAAGCGGGACACTGGGAT 

ACAGACGCAGTGCCTCCTTGATGCACATCTCCATGTAACGCATCTCGTGCAGA 

TCCGTCATCGTGGGAGCTCTATTACTGTCCTCGAATATGGTCGCCAGCTCCAG 

GACACAGCGATCCTGGCACTCGGGATTCTGTGTCAGCAGAAAGAGAGTGAAA 

GCCACGGCGGCACCCACCGAATCCTGGCCAGCCAGCATAAAGGTACAGGCCT 

CGTTGACGATATCCTCCTCGGTGAAGTCCCGATTGCTCTCGGAGATCTCGATC 

ATGTGGTCGAGCAGACACTTTCGCTCGCTATTGCCATTATTATTGTTCTGGAT 

TTGGCGACGTCTCTGGATCATTTTGCGTGTGAAGTCATTGAGGCGCTTCTTCT 

GGTTAAGCTCATCGTTGGCCATCTTGGTCCAGTGGTAGATCCCGTCCAGCAGC 

AGCCAGGGTTGCGTAAACCGCGCGGGCATCATGATCTTGCCCTGGCGGAACG 

GCGAGTCCTCCATCATGGCCACATCCTGACCTCTTTTCTTGATCGGCACACCC 

AAAACGGCCTCTGCAAGTCGTTCAGGGATTAAGTGAGAAATTATAGCTTGCT 

AATCCCCTAGAGACTCACCATTTAGTATGTCCAGTACACAGTTGTTCACGTAC 

TTGGCAATATTTATCTCCGTTCCCACGGCTTCGGCATCCAGATTCTCGTACAA 

CGATTGCGAGGCATCCACAAAGGTGTCGATGAACTTCTCCAGCAGATTGTGA 

TGAAACGCTGGCTGGATGAGCCGTCGATGATTGCTCCACTTGGAACCACTGC 

TGGTTATCAGCCCATCACCCAGGAAATTGTGCATCAGTCGGTAGAAGAAGAC 

CTTGTTGGTGTGCTTCTTCGAGGAGAGTATCACCTGCAGATCCTCCGGCTCCA 

GGACAGCAAAGAAGGGAAAGAGCAGCACCCAGATCCGCACCAGAGATCCAT 

ATAGATCGAAGGCCTTGCCGGCACATCTGCGCATCACTGTGAAATGGGATTC 

AATTTAACTTAAAAGGTATCTTTCACGAAAAGGTTTCTTCAAGGATCTTACAG 
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TCCTTATCCGTGACCAGCATGCAGTTGCCCAGAAATGGCAGCGATGGCGGAC 

CCGTGAGTCTCAGCGAGAGGAGAACCGATCTCAAGTACGTGTTCAGGGTGGC 

GTAGAATGTGTAGATGCTCAGGCTGATCACCAGGAGGATCAGAATGGAGCAT 

AGCTCCAAATTGGTGGTGCGCTCCAGCTGTGGGGGCGAAAGCAAACGTAAAT 

GCATTGGGTCAAGTCGCGTGGATAATTGCCCGCTTAGGTCAATATTTGGTTTG 

CTATCGAGAACGCCGAGCTCTTGAACGCACTTCATCAGCTACGCACTGCGCTC 

ACTGGAGTCTAATTAACTGAGGAATCTTGGAGCACTTAGGCATTCGAACTTG 

GATGCGAGCACTTGCCCTTGCCGCGTGTCGCAAGTTTTCGGCAAACACACGTT 

ATCGTAATCGCAACGAAAGTATAAGTTATGTATCTAACTGCGGTGTGAATTG 

CTTGGGCCAACATGGCGTATGGGCGATGCTATAAGTACGTGTGTGTGTATCC 

ATAAGATCGATTAAAGCACCACACCGTTCATGTGTACGTGTTGCTTTGCTTTG 

GTTTTTTTTTCTTTATTTTTGGGCCATTCGCGTCGATGTTTCGTGGTGCAACAG 

GTTACACGATGAGCACAAAACATGACAAATGATGATGATCACCGGACAAAA 

ATCCAGGGACAGCCTTTTGTTGCCCACACTTGCCACACCTGTCGTCGCCCCAC 

ACCCTTGCACCACTAACCCCCCCCCCCCCCCCAAAAAAAAACCCCTTTGTTTG 

GTTCTGGACGAGAGTGAGAGCCCCAACACCATTAGCCAAATGCGATTGGTTT 

CAGGGCCAAGTGAAACCACCGGTTGGTTAACTGACTCAGATCTCAATGATTA 

ATTTATTACGGACAAGGAATCGGCAAACGATCGCAGTTGGTCATCATAAAGT 

TTATCCAAAAATCTAGGTGGCATTCCATTTAGTGGGAACTTCTTACCATCAGT 

TCGTAGTAAGCTAAGTTAAAGAGTAAAATAATAGGCGCTTTTAATCCTCCTCA 

GCCACCTCATCCTCGTAGCCCTCGGGCAGGGCATTCACCTCGGCATTGTGAAA 

CAAACTGCGATTGCCATCTCCCCATGGGAATCGTTTGGTCCGCCGCCTCAGGT 

ACTCGTACTTGGCGAACGGCTCCCGCTCCACGTGCTTGTGTCCGGTGAAGGCG 

TTTGCGGCGCACAGGACGATGGCCGGTAGGGCCAGCAGGAAGGTGACACGC 

TTCCACAGACCAGCGGTATTGGCGGGCATATTGCCCATAAGCCGGATATCAA 

AGATCGGCATGTATCTGCACAATAACTCGACTTATTCTGATGCCTGGCTGGAT 

GGCTAAGCTAGCTTGAATTTGAATACTACGTACTGTAGCGATTTGAATCTGAT 

CCGTAACACCCACGGCTGCTGCCCGAAACTATTGTCGCAATTAGGAACTCTCA 

AGGGGATCCGAGCCAGCGCCACAAGGTCCAACAACCCGCGTATCTTTGTTTA 

ATCAGCCCAATATTTGACCAGAAACCGCTGAAGCGTCCAGAAGGCTTGCGCT 

CCGCTCCAAGCGGCTACTTCTATTTTTTTTCCTCTTCATCGAATATTTCTTTCTT 

AGTTTCGAATGCTTCTTTTTTTTTTTGGGCACGGCATATCCATCCCCATCCAGG 

CTGCGAGGTGTGCAGAACCCGCGCCGTGTTTGCTCGCCAATTGGCACCTGGC 

CACTAATAGATATACATCATGATTATTTCCCACTAATTCCATAAGTTATCATA 

ATGGTCTTCCTAAACGAGAGGCTGCTTGTCGAGGCACTAAGACCGCCCAAAA 

TCTAACGATCCATTGAGATTGCGGTTAAAAATGATTCAAATGCAAGCGAAGT 

TACTAAAATTTGTGAGAGTATATCTAGTTGAAAACTTGAACTTGAAAATGTG 

GTTTTCATAAAATTATCCAAATTGATGGGTGTGAATTAAAATTAAATTAAAAA 

CTTGCACTTGAATACTCAAAATTCATTGCTCAATTCATTGATAGATAAATGAT 

GATTAAATAATAAGTTATAAACCTAGATCATTTCACTTTAGTATTGGTAATGA 

AATTTAGGTTTATATATCCTCACTCTTCTTAAAGTAATGTAAATATTTGTTATC 

CTTTAGGAAATACACCTTATTAAAATAATTATTTTAAATTCTATTAAAATTCTT 

TTAAAAAACAGAAACGTAATAGCCACCATTTTACATTTTACTTAAACGTTTTT 

CCTTTTCTTTTTTAAACTTTAGCTGTGAGTAATCCTTTTTATTCATAACGAATT 

GCGTTTAAATATTTTTATATTTTCTTCACTCACCACTTTTTCCACAAACATTTT 

AGTCACGTATTTGTATTCCCTTGATATAGTCAATATATTTTGTTTTTATCTTTA 
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ATAGCTTCACACAAAAGTCCTTGCCACAAGCACTGTCCAAATCCACACATAC 

ACCAAGTTAGTTAGCTCCACTTCGATTTGGGATATATCCGTATTGTGATCTTA 

TTGGCCAGAGTCACATCCGGCGACTGATGAGCTACGAGTGCGGGACCTCCGC 

CGGTTAGCGTCTATTTATAACCGATTTGGCCCGATCAAGCTCGGCTTGAACGC 

CGCCGAAAATGATGTACGTGCTAGCTAAGTCGTTGGAGTCCCCGGATACCCG 

AATCCCCGTATCACCGAATCACCAAATCGCCGAGTCGCCGCGTATCCGCTAG 

ATGCCCGAGTGTATCGAGTATAGGTAGTAATTGCCAACTAGTCGGCACTCGA 

AGTGCTAAGTAGCTAGAAGTGGATATGGTGCTGGATGCTGGATGCCCCTGCC 

AAGTGGCATGGCAATCAATTATCCGTTTGGTGCTTGTTTGATGTATCCCTCCT 

CCGCCACCGCCCACGCTATCCACCTCCTCCATGGAATGGCAGACCCTTTGGGT 

TGCCAGTGGCACTCAACGATCTGAGCGGTGGGAACGAGGGGGAAGTCAGCT 

AGAAATCTTCAGACGCGTGCCAGTGGATCGAACTTTGAGCGGATATTCAAAT 

GGCGCAGACGGACCTCTTCACGTTGTTGATTACATCAATCATTTATGTTTACA 

GTGTAGTCGTCGGATCTTTGCACATTCACATACTGTTGCTTTTAACGTCATCAT 

AAATTCTACAAAATATATTCGGGATTTATTTCCGCGAAATTTTAACCTTTGCT 

AATCTTCAATTGTTTACTGAACACAAACGAATTACGTTTCTTATTCATTCATTC 

ATCATTCAGTATTCCTAGATGTGTTTAGTCAGTTAAGATCGTTTGAGTTATAG 

GTTTAGAAATCTTGGAAATTCAATAGCGCATTGGTTACTGATTAAGAGTTATT 

ATCAGTAAGAATATTATTAGTAATTATTATTATGCCAATCAGACCGATTAGAC 

TACCACTTCTTGTACTTTTGCTGCGAGTTCTCGTGCACCACCGATTAATATGGT 

AAATAAATCTCAGCCTGCTTTTCCAACACCACTTATCTGAAGACACGATTCCA 

TGGAGCACATGGAGATTGAGATTACAGCCATCGACTAGACGCCTTCGTCATT 

CGGGACGATTAAAGTTCAGTGGCAAATGAAATAGAGTCGATCGATGACTCAA 

CGGATCGCTTTGCTGATAATCCCCATTTGTGTTCTCCTTAGCTGGCCAGTTGA 

CTTTTTGGTCAGTTGACTTTCTGGCCAGTTGGCTTTCTGGCCATTTGGGTCTTT 

AGAAGACCACCGCCCAGTTTATAATGGATATAAAACGAATTGAGCTGCAAGT 

CGTATAAACTTTACGATATCATAGCAGAAGTTTATGAAAATCCAAAATACCA 

ATCATGGATGATCGCTAAATTCGCCATTTTGGTGATAAGTGATAAGCTGGCTA 

CTCCAGCCCTATATAAGAGACCTAAATCGAACCACACTTTAAGTTTAACCATG 

TCGCTACGTTTGGGTTTGTTTCTTTTGGCTGCACTTGGTGTGGTAATTCTCACG 

GATTCCGCCTCCATAAGCACCCACATTGTTGGTGGCGATCAGGCGGACATCG 

CTGACTTTCCGTACCAGGTGTCCGTTCGCCTGGAGACCTACATGCTGCTCCAC 

ATCTGCGGTGGTAGCATCTATGCACCACGGGTCGTCATCACCGCCGCCCACTG 

CATCAAGGGACGCTATGCCTCGTACATCCGGATCGTGGCTGGTCAGAACTCG 

ATTGCCGATCTGGAGGAGCAGGGTGTTAAGGTCAGCAAACTGATCCCCCATG 

CCGGCTACAATAAGAAAACGTATGTGAATGATATCGGTTTGATCATCACTCG 

CGAGCCATTGGAGTACTCAGCCCTGGTGCAACCCATTGCTGTGGCCCTGGAG 

GCACCGCCGTCGGGTGCCCAGGCCGTTGTAAGTGGTTGGGGCAAGCGGGCTG 

AAGATGATGAAGCTCTGCCCGCCATGCTGCGCGCCGTTGAGCTGCAGATCAT 

CGAGAAGAGCACCTGCGGTGCCCAGTATCTGACCAAGGACTACACGGTGACC 

GATGAGATGCTCTGCGCCGGCTATCTGGAGGGCGGCAAGGACACCTGCAACG 

GCGATTCCGGTGGACCCTTGGCCGTGGACGGAGTCCTGGTGGGTGTGGTGTC 

CTGGGGCGTGGGTTGCGGCAGGGAAGGATTCCCGGGTGTCTACACCAGCGTC 

AATTCCCATATCGATTGGATCGAAGAGCAGGCGGAGGCGTATCTCTAAAAAT 

GTGGATAGCTTCACAAGCACAACGCGAACAAATAAATCGAACAAATTATTAT 

TTTACCACAATAATAAATATGAAATGAGCATTTAGAAAACATGGTTTATAAT 
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ATATTTACAAATTAATATACGGTGTTTAACTCTTCATTTCAACTGGTTTTCCTA 

ATCAAAAACCTTTTTTATCTGACCATTACATTGGAATCTATAAGCCATTCTCG 

ACGATTTATATAAAAATAAAATTATTACCCAATTGGCATAGGTGAAGGCAAT 

TTATCTTGAGGAAGGGAAAAAGTACAATGTAACTAACCATAAATTTTATACT 

TTACAAAATCGTTTGATTGCATCATTTTAGAATAACTCAATGCAGAAATTAAA 

ATTATAAAATATGTAAATGTGGCTTGAAGTATCATTATTATTTATTTGTGACA 

TTTATATTTGACTTGATGCAATCAAATAATATCCACAATATTAGAAATTTACC 

GTTTGCAGATAGTTTAACGTATTCGAGTAAGATTACATTTGTTTAAATCTTAA 

AAATTTAAAATAATTAGGAAGATTTTGTTTTTAAATATTAACGGCTTCTGGTA 

TTTTTTAGAGCTAGTATATACTTTCGTGGTAGACGTCGCTGGTATTTAAGCCA 

GTAAGATTCAGCCACACTGACAAAGAAAATATTCGTGAAAATTCTGCATACG 

GAAAGAAGAAAATTCGAGCAACAGAAAGCCAACACAATCCACAAAAATGTC 

TTTATTCGGAGCGTTGATGGGTGATTTCGACGACGATCTCGGCCTTATGAAGT 

AAGTACCAAATGGCGCAAAAAAAAACTAAATAAATGCGGCTCGCCCCGCAG 

AAGCCCCATATATTTCCATACGTGTGCAGCTAACGAAGCCCTCTTGGGGCGTG 

GAAAAACAGCCAAATAATCGCAAAACAAGGTGTAAATCATTAATTGGCCCAT 

AGGCACACAATTAGGCCAATTAAACATATTTACGTGCCCAAAAATTAGCAAT 

AAATAGCGTGCCAAAATTAACAGTAACCATCGGAGTGTGCGTGTGTGTGTGT 

GCGCAGCATGCGTGAAGTGAAGACGTAATAATCGATAATTTGAATCGAGCGA 

CCGCAGGGAAATGGAATTGGGGAAAATGCACTAGCAGGCGTTATTTCAAAGG 

TTTCGCCCTGTCACTGGGACTTTTGATAAGGCCCAACCGCAAAGTGACCCATG 

TAAAGGCAGGCTATCAGACCCTATTTTATGTATATACGTAGGCTACGCTGCCT 

TTATCACTATACTGCGATATTTGGCCACAAGTCATTTAGTTTGGCTTTGTTTAA 

AACTTAATTTCGGCTCAGTTTAAAATGAAACAAAAACGTAAAAGCAAATCAA 

ACCGTTCACAAATGGAGCTCCAGTAACTCGCACATCAGTCAAGTATCACTAA 

GTTACTCATCTTTCGTTTGCAG 



